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ANIMAL TISSUE FOR XENOTRANSPLANTATION 

TECHNICAL FIELD 

This invention relates generally to the fields of carbohydrate chemistry and animals 
engineered with a genetic knockout. More particularly, it relates to non-human mammals that 
are engineered to be deficient in the enzyme a(l,3)galactosyltransferase, and consequently do 
not express the Gala(l,3)Gal xenoantigen. 

REFRENCE TO RELATED APPLICATION 

This application claims the priority benefit of U.S. Provisional Application 
60/204,148, filed May 15, 2000. The priority application is hereby incorporated herein by 
reference in its entirety. 

BACKGROUND 

The acute shortage of human organs for transplantation is a compelling force towards 
the development of new sources of suitable tissue. One option that is gathering considerable 
attention is the use of organs from other animal species. The main challenge to overcome is 
rendering foreign tissue immunologically compatible with the patient being treated. 

Tissue from most mammalian species would undergo hyperacute rejection when 
transplanted into humans. This is because human plasma contains natural antibodies against 
carbohydrate determinants of the animal tissue, thought to originate through prior immune 
stimulation by dietary antigen or mucosal microflora. Since the antibodies are pre-formed, 
rejection occurs within days of the transplant. 

1 
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The main target for the natural antibodies mediating rejection is cell-surface 
oligosaccharides expressing the determinant Gala(l,3)Gal (reviewed by Joziasse et al., 
Biochim. Biophys. Acta 1455:403, 1999). Humans, apes and Old World monkeys differ from 
other mammals in that they lack a-galactosyl epitopes in complex oligosaccharides. Other 
mammals express the Galcc(l,3)Gal epitope prominently on the surface of nucleated cells, 
including hepatic cells, renal cells, and vascular endothelium — which is especially problematic 
for xenotransplantation of whole organs. 

The Galcc(l,3)Gal epitope is made by a specific enzyme, a(l,3)galactosyltransferase, 
abbreviated in this disclosure as ocl,3GT. The transferase uses UDP- galactose as a source of 
galactose, which it transfers specifically to an acceptor oligosaccharide, usually 
Galp(l,4)GlcNAc (N-acetyl lactosamine). In mammals that don't express the Gala(l,3)Gal 
product, the al,3GT locus is inactivated (Gailili et al., Proc. Natl. Acad. Sci. USA 15:7401, 
1991). There are frameshift and nonsense mutations within the locus, turning it into a non- 
functional, processed pseudogene (Laarsen et al, J. Biol. Chem. 265:7055, 1990; Joziasse et 
al, J. Biol Chem, 266:6991, 1991). 

In humans, N-acetyl lactosamine acceptor oligosaccharides are processed differently. 
The enzyme a(l,2)fucosyltransferase builds the N-acetyl lactosamine into 
Fuca(l,2)Galj3(l,4)GlcNAc, which is blood group H substance. This in turn serves as an 
acceptor substance for blood group A GlcNAc-transferase, or blood group B Gal-transferase, 
forming A-substance or B-substance, respectively, depending on the blood type of the 
individual. Naturally occurring antibodies circulating in the blood are reactive against the 
alternative carbohydrate determinants that are not self-antigens. 

Larsen et al. (Proc. Natl. Acad. Sci. USA 86:8227, 1989) isolated and characterized a 
cDNA encoding murine od,3GT. Joziasse et al (J. Biol. Chem. 267:5534, 1992) detected four 
distinct mRNA transcripts, which predict four different isoforms of the ocl,3GT. The full- 
length mouse mRNA (including 5' untranslated mRNA) was reported to span at least 35-kb of 
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genomic DNA, distributed over nine exons ranging from 36 base pairs to -2600 base pairs in 
length. Numbering in the 5' to V direction, the coding region is distributed over Exons 4 to 9. 
The four transcripts are formed by alternative splicing of the pre-mRNA. 

Joziasse et al. (J. Biol Chem. 264:14290, 1989) isolated and characterized a cDNA 
encoding bovine cDNA. The coding sequence was predicted to be a membrane-bound protein 
with a large glycosylated COOH-terminal domain, a transmembrane domain, and a short NH 2 
terminal domain. 

The porcine al,3GT cDNA sequence has been reported from several different 
laboratories: Strahan et al. (Immunogenetics 41:101, 1995); U.S. Patent 5,821,1 17; U.S. 
Patent 5,849,991; and International Patent Application WO 95/28412. The genomic 
organization of porcine ocl,3GT was reported by Katayama et al. (Glycoconjugate J. 15:83, 
1998). Again, the coding region spans six exons, conserving the arrangement present in the 
mouse genome, and extending over nearly 24-kb. 

It has been reported that about 95% of the naturally occurring xenospecific antibody in 
humans recognize the Galoc(l,3)Gal epitope (McKensie et al., Transpl. Immunol 2:81, 1994). 
Antibody in human serum binds specifically to pig endothelial cells in a manner that is 
inhibitable by Gala(l,3)Gal, or by Gala(l,6)Glu (melibiose). New age monkeys have the 
same naturally occurring antibody, and demonstrate hyperacute rejection of pig organ 
xenotransplants. The rejection reaction can be obviated in experimental animals by infusing 
the recipient with the free carbohydrate (Ye et al., Transplantation 58:330, 1994), or by 
adsorbing antibody from the circulation on a column of Gala(l,3)Gal or melibiose (Cooper et 
al., Xenotransplantation 3:102, 1996). 

It has been suggested that xenotransplants of pig tissue could provide a source of 
various tissue components — heart valves, pancreatic islet cells, and perhaps large organs such 
as livers and kidneys (Cowley, Newsweek, Jan 1/2000). If xenotransplants from non- 
primates into humans is ever to become viable, then techniques need to be developed to 
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suppress Gala(l,3)Gal expression. Possible genetic manipulation strategies are reviewed by 

Gustafsson et al. (Immunol. Rev. 141:59, 1994), SandrinetaL (Frontiers Biosci. 2:el-ll, 
1997), and Lavitrano et al (Forum Genova 9:74, 1999). 

One approach is to prevent the formation of Gala(l,3)Gal by providing another 
5 transferase that competes with al,3GT for the N-acetyl lactosamine acceptor. International 
Patent Application WO 97/12035 (Nextran-Baxter) relates to transgenic animals that express 
at least one enzyme that masks or reduces the level of the xenoactive antigens by competing 
with al,3GT. The enzymes proposed are a(l,2)fiicosyltransferase (that makes H antigen in 
humans), a(2,6)sialyltransferase, and P(l,3)N-acetylglucosaminyltransferase. It is thought 

1 0 that once N-acetyl lactosamine has been converted by one of these transferases, it can no 
longer act as an acceptor for al ,3GT. The xenotransplantation cells of Application 
WO 97/12035 have at least one enzyme that reduces Gala(l,3)Gal expression, and also 
express a complement inhibitor such as CD59, decay accelerating factor (DAF), or membrane 
cofactor protein (MCP). Expression of human CD59 in transgenic pig organs enhances organ 

1 5 survival in an ex vivo xenogeneic perfusion model (Kroshus et al., Transplantation 61:1513, 
1996). 

Another approach is to disassemble Galoc(l,3)Gal after it is formed. International 
Patent Application WO 95/33828 (Diacrin) suggests modifying cells for xenogeneic 
transplants by treating tissue with an oc-glycosidase. Osman et al. (Proc. Natl. Acad. Sci. 

20 USA 23:4677, 1997) reported that combined transgenic expression of both a-glycosidase and 
a(l,2)fucosyltransferase leads to optimal reduction in Gala(l,3)Gal epitope. Splenocytes 
from mice overexpressing human a-glycosidase showed only a 15-25% reduction in binding of 
natural human anti-Gala(l,3)Gal antibodies. Mice overexpressing human 
a(l,2)fucosyltransferase as a transgene showed a reduction of Gala(l,3)Gal epitopes by 

25 -90%. Doubly transfected COS cells expressing both the glycosidase and the transferase 
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showed negligible cell surface staining with anti-Galoc(l,3)Gal, and were not susceptible to 
lysis by human serum containing antibody and complement. 

A further alternative is to prevent Galoc(l,3)Gal expression in the first place. Strahan 
et al. (Xenotransplantation 2:143, 1995) reported the use of antisense oligonucleotides for 
inhibiting pig al,3GT, leading to a partial reduction in expression of the major target for 
human natural antibodies on pig vascular endothelial cells. Hayashi et al. (Transplant Proc. 
29:2213, 1997) reported adenovirus-mediated gene transfer of antisense ribo2yme for otl,3GT 
and afucosyltransferase genes in xenotransplantation. 

U.S. Patent 5,849,991 (Bresatch) describes DNA constructs based on the mouse 
al,3GT sequence. They are designed to disrupt expression of functional al,3GT by 
undergoing homologous recombination across Exon 4, 7, 8, or 9. The constructs contain a 
selectable marker such as neo R , hy^ or thymidine kinase. It is proposed that such constructs 
be introduced into mouse embryonic stem (ES) cells, and recovering cells in which at least one 
ocl,3GT gene is inactivated. Experiments are reported which produced mice that are 
homozygous for inactivated al,3GT, resulting in lack of expression of Galoc(l,3)Gal epitope, 
as determined by specific antibody. 

U.S. Patent 5,821,1 17 (Austin Research Inst.) report cDNA sequence data for porcine 
al,3GT. This was used to probe a pig genomic DNA library, and two lambda phage clones 
were obtained that contain different regions of the porcine transferase gene. 

International Patent Application WO 95/28412 (Biotransplant) also report cDNA 
sequence data for porcine al ,3GT. It is proposed that genomic DNA fragments be isolated 
from an isogenic DNA library, and used to develop a gene-targeting cassette including a 
positive or negative selectable marker. 

International Patent Application WO 99/21415 (Stem Cell Sciences, Biotransplant) 
reports construction of a DNA library from miniature swine. A vector is obtained comprising 
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apgk-neo cassette, and fragments of the al,3GT gene. This is used for homologous 
recombination to eliminate ocl,3GT activity in porcine embryonic fibroblasts. 

Costa et al, Alexion Pharmaceuticals (Xenotransplantion 6:6, 1999) report 
experiments with transgenic mice expressing the human complement inhibitor CD59, with 
mice expressing a(l,2)fucosyltransferase, and with al,3GT knock-outs. Expression of CD59 
in combination of either a(l,2)fucosyltransferase or al,3GT null phenotype prevented 

human serum-mediated cytolysis. Triple combination of all three modifications provided no 
additional protective effect. 

There have been no reports of the use of al,3GT inactivated tissue for 
xenotransplantation into humans. In view of the paucity of available organs for human 
transplantation, there is a pressing need to develop further options. 

SUMMARY OF THE INVENTION 

Immunologically compatible animal tissue for xenotransplantation is described in this 
disclosure. Sequence data for the sheep a(l,3)galactosyltransferase (cd,3GT) gene is 
provided, which enables construction of targeting vectors for inactivating the al,3GT gene. 
Successfully targeted cells can be used as nuclear donors for obtaining animals lacking the 
Galot(l,3)Gal determinant. The tissues can be transplanted into human patients, without 
being subject to hyperacute rejection by antibodies to the Gala(l,3)Gal determinant normally 
present in human serum. 

One embodiment of the invention is animal tissue devoid of Gala(l,3)Gal 
determinants, such as can be detected by antibody binding according to standard techniques. 
Exemplary is tissue from ruminants of the subfamily caprinae, especially sheep. Tissue of 
this nature suitable for transplantation to humans includes but is not limited to lung tissue, 
kidney tissue, liver tissue, cardiac tissue, pancreatic tissue, and ocular tissue. 
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This invention also embodies a polynucleotide construct effective for inactivating an 
al,3GT gene. Exemplary constructs comprise at least two polynucleotide sequences from an 
ovine al,3GT gene in a non-natural arrangement, for inactivating the ccl,3GT gene by 
homologous recombination, leading to deletion or interruption of the al,3GT encoding 
sequence, or replacement with an alternative sequence. Optionally, the vector comprises an 
intron sequence of at least 30 consecutive nucleotides homologous to any of recombinant 
phage plasmids illustrated below, or deposited in support of this disclosure. In certain 
applications, ocl 5 3GT gene is inactivated by contacting the cell with the polynucleotide 

construct of this invention under conditions suitable for homologous recombination of the 
construct into the genome of the cell. 

Another embodiment of the invention is a cell that expresses glycosyl transferase 
enzymes but does not detectably express a(l 5 3)galactosyltransferase (al,3GT). The cell may 
be heterozygous or homozygous for inactivation of an al,3GT gene. 

Such cells and tissues can be obtained from a whole animal such as a sheep that is 
homozygous for inactivation of an al ? 3GT gene. Such animals can be produced by engrafting 
into a female of the same species embryo having at least one inactivated al,3GT allele. 
Embryos can be generated, for example, by transfer of a nucleus from a donor cell with an 
inactivated al,3GT gene, to an enucleated recipient cell. If the birthed animal is heterozygous 
for al,3GT inactivation, homozygous inactivation can be achieved by selective breeding, or 
by a second round of gene targeting to inactivate the second allele. 

A further embodiment of this invention is an isolated polynucleotide comprising a 
sequence of 30 or more consecutive nucleotides contained in (or capable of hybridization with) 
SEQ. ID NO:l or 14 to 25, or which is contained in phage deposited in support of this 
application, but which does not appear in other known sequences, such as SEQ. ID NOs: 3, 5, 
7, 9, 1 1, and 13. Included are polynucleotide constructs effective for inactivating a 
a(l,3)galactosyltransferase (al,3GT) gene. Such polynucleotides can also be used in an assay 
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for determining al,3GT expression, in which the polynucleotide is combined with analyte 
mRNA or cDNA, and the formation of a duplex between the two is correlated with expression 
of al,3GT by the cell 

Another embodiment of the invention is an isolated polypeptide comprising a 
sequence of 10 or more consecutive amino acids contained in or homologous to SEQ. ID NO:2, 
or encoded in a polynucleotide of this invention, which is not fully contained in any other 
known sequence, such as SEQ. ID NOs: 4, 6, 8, 10, and 12. Such polypeptides may have 
glycosyltransferase activity, or be immunologically cross-reactive with al,3GT. In certain 
contexts, the polypeptides are used to prepare antibody. In other contexts, the polypeptides 
are used to prepare a Galoc(l,3)Gal determinant by combining with a galactose acceptor 
saccharide in the presence of UDP-galactose. 

Further embodiments of the invention are polyclonal and monoclonal antibodies 
(including immunoglobulin derivatives and equivalents) that bind specifically to a polypeptide 
of this invention, but not with other polypeptides. Some such antibodies react with ocl,3GT 
of other mammalian species, while others are specific for ovine ocl,3GT. They can be used, 
for example, in an assay to determine al,3GT in a sample, in which the sample is combined 
with the antibody, and any complex formed is correlated with the presence of ocl,3GT. 

Cells and tissues of this invention can used to prepare tissue devoid of antibody- 
detectable Gala(l,3)Gal determinants for treatment of the human body by surgery or therapy. 
This invention provides a method of xenotransplantation, comprising transplanting tissue 
devoid of antibody-detectable Gala(l,3)Gal determinants into a mammal that may have 
circulating antibody against Gala(l,3)Gal determinants, such as a human. 

These and other embodiments of the invention will be apparent from the description 
that follows. 
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BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 is a chart providing certain sequence data of this invention, which is 
oc(l,3)Galactosyltransferase cDNA in the sheep (ccl,3GT) (SEQ. ID NO:l). The predicted 
protein sequence is shown below (SEQ. ID NO:2). Numbers 4 to 9 indicate the projected 5' 
boundaries of Exons 4 to 9 in the genomic sequence. 

Figure 2 is a chart comparing the cDNA nucleotide sequence of sheep al,3GT (SEQ. 
ID NO:l) with the bovine homolog (SEQ. ID NO:3). Identical residues are marked with an 
asterisk (*). The putative start codon is indicated with an arrow (H). 

Figure 3 is a chart comparing the protein sequence of sheep ocl,3GT (SEQ. ID NO:2) 
with the bovine homolog (SEQ. ID NO:4). Identical residues are marked with an asterisk (*). 

Figure 4 is a half-tone reproduction, showing Southern blot analysis of sheep genomic 
DNA. Probes for two different ccl,3GT exons yield one band, demonstrating that there is one 
copy of al,3GT in the sheep genome. 

Figure 5 is a half-tone reproduction, showing Northern blot analysis of mRNA from 
sheep fetal fibroblasts. There is a single expressed transcription product for al ? 3GT ~3-kb in 
size. 

Figure 6 is a restriction map for the sheep al,3GT gene. Phage clones were obtained 
for the regions shown, and used to determine intron sequences. 

Figure 7 is a map showing sequenced intron regions of al,3GT. The areas numbered 
1 to 9 in the figure correspond to SEQ. ID NOs:15, 16, 19, 20, 21, 22, 23, 24, and 25, 
respectively. 

Figure 8 is a map showing the design of two targeting vectors for inactivating the 
al,3GT gene by eliminating one of the exons through homologous recombination. Each 
targeting vector comprises a selectable marker (neo) 9 flanked on one side by an intron sequence 
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of 1-2-kb, and on the other side by an intron sequence of 7-10-kb. A number of vectors have 
been obtained, including some that target Exon 4, Exon 8, and Exon9. 

Figures 9 to 11 are drawings that compare the details of the targeting vectors p0054, 
p0079, and p0063 5 respectively, with the region of the al,3GT gene sequence targeted for 
homologous recombination (shown below). These vectors are designed to replace the coding 
region in Exon 4 with the selectable marker neo or pac. 

Figure 12 is a drawing showing how vectors p0054 and p0079 can be truncated for 
use in an adeno-associated virus (AAV) vector. 

Figure 13 is a drawing that compares targeting vector p0078 (above), with the targeted 
Exon 8 region of al,3GT (below), to be substituted with selectable marker neo. 

Figures 14 and 15 are drawings that compare targeting vector p0047 and p0046, 
respectively, with the targeted Exon 9 region of al,3GT (shown below), to be substituted 
with selectable marker neo. 

Figure 16 is a half-tone reproduction, showing PCR analysis of sheep fetal fibroblasts 
targeted with the p0054 vector. Using two primers for intron sequences that flank Exon 4 
(upper panel), the expected product is 2.8-kb for native al,3GT, and 2.2-kb after homologous 
recombination. Using a primer for the selectable marker (lower panel), amplification product 
is expected only after recombination. The results show that one of the samples is from a 
fibroblast that has successfully been targeted — replacing Exon 4 with the selectable marker. 
Since Exon 4 contains the translation initiation site, this would inactivate the al,3GT gene. 

Figure 17 is a half-tone reproduction, showing PCR analysis of umbilical chord taken 
from a sheep fetus, generated by transfer of a ccl,3GT targeted nucleus to a quiescent oocyte, 
which was implanted into a surrogate female at estrus. Two bands were observed: 2.8 kb, 
corresponding to the wild type ocl,3GT gene, and 2.2 kb, appropriate for a targeted al,3GT 

gene. These results are consistent with inactivation of the ocl,3GT gene on one haplotype. 
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DETAILED DESCRIPTION 

This invention provides new sequence data for the a(l,3)galactosyltransferase 

(ocl,3GT) g ene i n the Black Welsh Mountain sheep. The data includes the cDNA sequence, 

5 encompassing the entire ocl ,3GT encoding region, and certain portions of the genome 

sequence, from upstream of the coding region, from introns, and from downstream of the 
coding region. 

The discovery of this sequence and nucleic acids containing them have made it possible 
to construct targeting vectors designed to disrupt expression of a al,3GT gene in eukaryotic 

1 0 cells, particularly sheep cells. Exemplary vectors targeting Exons 4 and 9 of the al ,3GT gene 
are shown in Figure 8. These exemplary vectors have a knockout sequence flanked by two 
regions homologous to the genomic sequence. After recombination, the knockout sequence 
replaces a portion of the gene so as to disrupt transcription of a functioning al,3GT — by 
elimination of Exon 4 or Exon 9, which encompass the translation start and stop codons, 

1 5 respectively. The knockout sequences in these exemplars comprise a selectable marker that 
facilitates the separation of cells that have been genetically altered after targeting with the 
vector. 

For certain aspects of this invention, mammalian cells that have had an al,3GT gene 
inactivated artificially in at least one haplotype are used to produce animals that do not 

20 express the Gala(l ,3)Gal epitope. Nuclear transfer permits a nucleus having an inactivated 
al,3GT gene to be transferred from a donor cell or cell line to an embryonic cell or oocyte. 
An embryo is then formed, engrafted into the uterus of a surrogate host, and used to birth a 
live neonate. Where the ocl,3GT gene is homozygously inactivated, the animal or its progeny 
are already able to supply cells and tissue lacking the Gala(l,3)Gal epitope. Where the 

25 cd,3GT gene is inactivated in only one haplotype, homozygous inactivation can be achieved 
by standard cross-breeding techniques. Tissue lacking the cxl,3GT epitope can then be used 
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for transplantation into humans without risk of antigenicity for the natural antibody that is 
normally the linchpin for hyperacute rejection of xenografts. 

Definitions 

5 For purposes of this disclosure, the term Gala(l,3)Gal (abbreviated GAL) refers to an 

oligosaccharide determinant present on endothelial cells and other cells of most non-primate 
mammals, for which humans have a naturally occurring antibody. The usual structure is 
Gala(l,3)Galp(l,4)GlcNAc, although other forms of Gala(l,3)Gal specifically detectable by 
the naturally occurring anti Gala(l,3)Gal in human serum of B blood type are included. 

10 Gala(l,3)Gal is distinct from the Gala(l,3)[Fuca(l 5 2)]Galp(l ? 4)GlcNAc determinant 
characteristic of the human B blood type antigen. 

An "antibody detectable" determinant refers to a determinant that is present in an 
amount and is sufficiently accessible so that it can be detected by an antibody specific for the 
determinant in an appropriate immunoassay — such as an agglutination reaction, optionally 

1 5 developed with an antiglobulin reagent, or by immunohistochemistry. 

The term "a(l,3)galactosyltransferase" and the abbreviation "al 5 3GT" refer to the 
enzyme present in non-primate mammals that catalyzes the formation of the Galoc(l,3)Gal 
determinant by attaching Gal in the a(l,3) position to the Galp(l,4)GlcNAc acceptor. 
al,3GT has the Enzyme Commission designation EC 2.4.1.124. al,3GT is not naturally 

20 expressed in humans, and the term does not include the galactosyltransferase that forms the 
human B blood group antigen. 

An "acceptor" substance for al,3GT is a carbohydrate structure that can act as a 
substrate and become galactosylated as a result of transferase activity. Acceptors for al,3GT 
include both Galp(l,3)GlcNAc and Gal|3(l,4)GlcNAc (Basu et al., J. Biol. Chem. 248:1700, 

25 1973; Blake et al., J. Biol. Chem. 256:5387, 1981). 
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A transferase is said to be "detectably expressed" by a cell at the mRNA level when 
mRNA encoding the transferase can be measured in the cell by some suitable technique, such 
as Northern analysis or PCR-reverse transcriptase. It may also be expressed at the protein 
level, as detected by a specific antibody or demonstration of the characteristic enzymatic 
activity. Those of skill in the art will recognize that some cells (such as mature red blood 
cells) do not express any glycosyl transferases, even though they display certain 
oligosaccharide determinants. Inhibition of al,3GT expression is only meaningful in cells 
capable of expressing other glycosyl transferase enzymes. Most nucleated mammalian cells 
can be assumed to have this capability. 

A gene is said to be "inactivated" when it is rendered incapable of transcribing a 
functional protein without further genetic engineering. For example, an inactivated gene may 
be missing necessary transcription or translation control elements, it may be lacking an 
essential part of the protein encoding region, or the encoding region may be placed out of 
phase. In a second example, the gene may be interrupted by an inserted sequence, or mutated 
in such a way as to interfere with transcription or translation of the gene product. In a third 
example, the inactivated gene may produce a translation product that has been altered in such a 
way that it lacks important enzymatic activity of the native gene product. 

The term "tissue" refers to a heterogeneous collection of cells responsible for 
maintaining one or more physiological functions. Of interest for certain embodiments of this 
invention are organs suitable for transplantation, such as a whole kidney; however, the term 
also includes organ fragments and other embodiments, such as a piece of connective tissue, or a 
collection of cells in a medical support device. 

The term "embryo" as it is used in this disclosure refers to an organism developing in 
the uterus of a species of interest at any time after fertilization or intrauterine transfer, not 
limited to a particular developmental period. The terms "engrafting" or "transplanting", in 
reference to embryo manipulation, refer to any process known in the art for artificially 
introducing one or more embryos into the uterus of a female animal. 
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An "individual" or "subject" refers to any vertebrate animal, usually a member of a 
mammalian species, including but not limited to domestic animals, non-human primates, and 
humans. 

It is understood that a clinical or biological "sample" encompasses a variety of sample 
types obtained from a subject and useful in an in vitro procedure, such as a diagnostic test. 
The definition encompasses solid tissue samples obtained during surgery or autopsy, and 
liquid samples such as blood, spinal fluid, and fluid obtained by amniocentesis, and various 
subtractions, enrichments, or solubilized extracts derived from such collections. 

A "host cell" denotes a prokaryotic or eukaryotic cell has been genetically altered, or is 
capable of being genetically altered by administration of an exogenous polynucleotide, such as 
a bacterial plasmid or recombinant vector. When referring to genetically altered cells, the term 
refers both to the originally altered cell and to the progeny thereof. 

A cell is said to be "genetically altered" when it contains an artificially introduced 
polynucleotide, or is the progeny of a cell so altered that has inherited an introduced 
polynucleotide. The polynucleotide may contain a sequence that is heterologous to the cell, it 
may contain native sequences in an artificial arrangement (e.g., an encoding region linked to a 
different promoter), or it may provide additional copies of a native encoding sequence. The 
polynucleotide can be introduced by transfection using electroporation or liposome-mediated 
transfer, homologous recombination, transduction using a viral vector, any combination 
thereof, or any other technique known in the art. The polynucleotide will often comprise a 
transcribable sequence encoding a protein of interest, which enables the cell to express the 
protein at an elevated level. In the context of the present application, a "genetically altered 
cell" has a genetic alteration that is inheritable by progeny of the cell. For example, an embryo 
having genetically altered DNA, if carried to term, will give rise to a neonate that has cells 
containing the genetic alteration. 

The terms "polynucleotide" and "oligonucleotide" are used interchangeably to refer to 
a polymeric form of nucleotides of any length. Included are genes and gene fragments, mRNA, 
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tRNA, rRNA, ribozymes, cDNA, recombinant polynucleotides, branched polynucleotides, 
plasmids, vectors, isolated DNA and RNA 5 nucleic acid probes, and primers. Also included 
are nucleotide analogs, including but not limited to thiol-derivatized nucleosides (U.S. Patent 
5,578,718), oligonucleotides with modified backbones (U.S. Patent Nos. 5,541,307 and 
5,378,825), and peptide nucleic acids (U.S. Patent No. 5,786,461). The term polynucleotide, 
as used in this disclosure, refers interchangeably to double- and single-stranded molecules. 
Unless otherwise specified or required, any embodiment of the invention that is a 
polynucleotide encompasses both a double-stranded form, and each of the two 
complementary single-stranded forms known or predicted to make up the double-stranded 
form. 

Polynucleotide sequences are said to be in a "non-natural arrangement" when they are 
joined together or interposed with another sequence in an arrangement not found in nature. 

When comparison is made between polynucleotides for degree of identity, it is 
implicitly understood that complementary strands are easily generated, and the sense or 
antisense strand is selected or predicted that maximizes the degree of identity between the 
polynucleotides being compared. A computer program that can be used for finding 
homologous sequences is the BLAST algorithm (Example 1). For purposes of the present 
disclosure, percentage of sequence identity is calculated by first aligning the polynucleotide 
being examined with the reference counterpart, and then counting the number of residues 
shared between the sequences being compared as a percentage of the region under examination. 
No penalty is imposed for the presence of insertions or deletions, but insertions or deletions 
are permitted only where clearly required to readjust the alignment. The percentage is given in 
terms of residues in the sequence being examined that are identical to residues in the 
comparison or reference sequence. Particularly desirable polynucleotide sequences preserve 
the function of the prototype: depending on context, an ability to hybridize with a target 
sequence, the function of a polypeptide it may encode, or (for certain gene targeting vectors) 
the ability to facilitate homologous recombination or gene inactivation. 
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"Hybridization" refers to a reaction in which one or more polynucleotides react to 
form a complex that is stabilized via hydrogen bonding between the bases of the nucleotide 
residues. The hydrogen bonding can occur by Watson-Crick base pairing, Hoogsteen binding, 
triplex formation, or complexing in any other sequence-specific manner. A hybridization 
5 reaction will, on occasion, be a step in a more extensive process, such as part of PCR 
amplification. Hybridization reactions can be performed under conditions of different 
"stringency". Conditions that increase the stringency of a hybridization reaction are widely 
known (see e.g., Sambrook et al, infra). Examples of conditions in order of increasing 
stringency: incubation temperatures of 25°C, 37°C, 50°C, and 68°C; buffer concentrations of 
10 1 0 x SSC, 6 x SSC, 1 x SSC, 0. 1 x SSC (where SSC is 0. 1 5 M NaCl and 1 5 mM citrate buffer, 
p pH 7.2) and their equivalent using other buffer systems; formamide concentrations of 0%, 
\n 25%, 50%, and 75%; incubation times from 5 min to 24 h; 1, 2, or more washing steps; wash 
IT! incubation times of 1, 5, or 15 min; and wash solutions of 6 x SSC, 1 x SSC, 0.1 x SSC, or 

; .. : 

!jf deionized water. Typical conditions of high stringency for the binding of a probe of about 100 
1 1 5 base pairs and above is a hybridization reaction at 65°C in 2 x SSC, followed by repeat washes 
O at 0. 1 x SSC — or the equivalent combination of solvent and temperature conditions for the 
M: particular nucleic acids being studied. 

q A "hybrid" of polynucleotides, or a "complex" formed between any two or more 

^ components in a biochemical reaction (such as antibody and antigen), refers to a duplex or 
20 higher-order complex that is sufficiently long-lasting to persist between its formation and 
subsequent detection. 

A "control element" or "control sequence" is a nucleotide sequence involved in an 
interaction of molecules that contributes to the functional regulation of a polynucleotide, 
including replication, duplication, transcription, splicing, translation, or degradation of the 
25 polynucleotide. 

"Operatively linked" refers to an operative relationship between genetic elements, in 
which the function of one element influences the function of another element. For example, an 
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expressible encoding sequence may be operatively linked to control elements such as 
promoters and enhancers that permit transcription, and control elements for translation such 
as initiation sequences, stop codons, and signals for polyadenylation. 

"Heterologous" means derived from a genotypically distinct entity from that of the 
rest of the entity to which it is being compared. For example, a polynucleotide introduced by 
genetic engineering techniques into an animal of a different species is said to be a heterologous 
polynucleotide. A promoter removed from its native coding sequence and operatively linked 
to a coding sequence with which it is not naturally found linked is said to be a heterologous 
promoter. 

The terms "polypeptide", "peptide" and "protein" are used interchangeably in this 
disclosure to refer to polymers of amino acids of any length. The polymer may comprise 
modified amino acids, it may be linear or branched, and it may be interrupted by non-amino 
acids. The terms also encompass an amino acid polymer that has been modified naturally or 
by intervention; for example, disulfide bond formation, glycosylation, lipidation, acetylation, 
and/or phosphorylation. 

Percentage of sequence identity is calculated for polypeptides by first aligning the 
polypeptide being examined with the reference counterpart or prototype, and then counting 
the number of residues shared between the sequences being compared as a percentage of the 
region under examination. No penalty is imposed for the presence of insertions or deletions, 
but insertions or deletions are permitted only where clearly required to readjust the alignment. 
The percentage is given in terms of residues in the sequence being examined that are identical 
to residues in the comparison or reference sequence. Where substitutions are made, 
conservative substitutions (in which one amino acid is substituted by another with similar 
charge, size, hydrophobicity, or aromaticity) are typically better tolerated. Desirable 
sequences preserve the function of the prototype: for example, the enzymatic activity, the 
binding of specific substrates, and the binding of specific antibody as detectable in a standard 
competition inhibition immunoassay. 
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The term "antibody" as used in this disclosure refers to both polyclonal and 
monoclonal antibody. The ambit of the term deliberately encompasses not only intact 
immunoglobulin molecules, but also such fragments and genetically engineered derivatives of 
immunoglobulin molecules as may be prepared by techniques known in the art, and which 
retains the binding specificity of the antigen binding site. 

An "immunogenic" compound or composition is capable of stimulating production of a 
specific immunological response when administered to a suitable host, usually a mammal. 

An "isolated" polynucleotide, polypeptide, protein, antibody, or other substance 
refers to a preparation of the substance which is enriched in relation to some of the other 
components that may also be present in the environment where the substance or a similar 
substance is initially obtained or occurs naturally. Thus, for example, an isolated substance 
may be prepared by using a purification technique to enrich it from a source mixture. 
Enrichment can be measured on an absolute basis, such as weight per volume of solution, or it 
can be measured in relation to a second, potentially interfering substance present in the source 
mixture. Enrichments by 2, 10, 100, and 1000 fold achieve improved degrees of purification. 
A substance can also be provided in an isolated state by a process of artificial assembly, such 
as by chemical synthesis or recombinant expression. An "isolated" cell is a cell that has been 
separated from the organism in which it was grown. 

A polynucleotide used in a reaction, such as a probe used in a hybridization reaction or 
a vector used in gene targeting is referred to as "specific" or "selective" if it hybridizes or 
reacts with the intended target more frequently, more rapidly, or with greater duration than it 
does with alternative substances. Similarly, a polypeptide is referred to as "specific" or 
"selective" if it binds an intended target, such as a ligand, hapten, substrate, antibody, or other 
polypeptide more frequently, more rapidly, or with greater duration than it does to alternative 
substances. An antibody is referred to as "specific" or "selective" if it binds via at least one 
antigen recognition site to the intended target more frequently, more rapidly, or with greater 
duration than it does to alternative substances. 
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General Techniques 

Unless otherwise noted, the practice of this invention can be carried out by employing 
standard techniques of genetic engineering, protein manipulation, and cell culture. Textbooks 
that describe standard laboratory techniques include "Molecular Cloning: A Laboratory 
Manual", 2nd Ed. (Sambrook et al., 1989); "Oligonucleotide Synthesis" (MJ. Gait, ed., 
1984); "Animal Cell Culture" (R.I. Freshney, ed., 1987); the series "Methods in Enzymology" 
(Academic Press, Inc.); "Gene Transfer Vectors for Mammalian Cells" (J.M Miller & MP. 
Calos, eds., 1987); "Current Protocols in Molecular Biology" and "Short Protocols in 
Molecular Biology, 3rd Edition" (F.M. Ausubel et al., eds., 1987 & 1995); and "Recombinant 
DNA Methodology IF (R. Wu ed., Academic Press 1995). Techniques used in raising, 
purifying and modifying antibodies, and the design and execution of immunoassays, are 
described in Handbook of Experimental Immunology (D.M. Weir & C.C. Blackwell, eds.); The 
Immunoassay Handbook (Stockton Press NY, 1994); and R. Masseyeff, W.H. Albert, and 
N.A. Staines, eds., Methods of Immunological Analysis (Weinheim: VCH Verlags GmbH, 
1993). 

Texts that describe reproductive techniques and embryo transfer in animals include 
"Manual of the International Embryo Transfer Society: A procedural guide and general 
information for the use of embryo transfer technology emphasizing sanitary procedures " 9 3 rd 
ed. (Stringfellow et al, Savoy, IL: International Embryo Transfer Society, Savoy IL); and 
"Embryo transfer in farm animals: A review of techniques and applications" (K.J. Betteridge, 
ed., Agriculture Canada Monographs No. 16, Ottawa, 1977). 

Polynucleotides 

The polynucleotides of this invention include those containing nucleotide sequences 
which are found within the ocl,3GT cDNA sequence, shown in SEQ. ID NO:l, or the genomic 
sequence and its 5' and 3' flanking regions, part of which is shown in SEQ. ID NOs:15 to 25. 
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Further sequence of the od,3GT gene can be obtained by employing standard sequencing 
techniques known in the art to the phage plasmids deposited in support of this application. 

Also included in this invention are polynucleotides containing al,3GT-like sequence 
that is from naturally occurring allelic variants, synthetic variants, and homologs of al,3GT 
with a percentage of residues identical to the ccl,3GT cDNA or gene sequence, determined as 
described above. It is understood that substitutions, insertions, and deletions can be 
accommodated within a polynucleotide sequence without departing from the spirit of this 
invention. In certain embodiments, the polynucleotide sequences are at least 80% , 90%, 95%, 
98%, or 100% identical to one of the sequences exemplified in this disclosure; in order of 
increasing preference. The length of consecutive residues in the identical or homologous 
sequence compared with the exemplary sequence can be at least about 15, 30, 50, 75, 100, 200 
or 500 residues in order of increasing preference, up to the length of the entire clone, gene, or 
sequence. 

Certain polynucleotides of this invention are distinct from polynucleotides already in 
the hands of the public, including previously known al,3GT cDNA or genomic sequences 
from other species (e.g., SEQ. ID NO:3, 5, 7, 9 and 1 1), other glycosyl transferase cDNA and 
gene sequences, and matching fragments from other published sequence data that are 
fortuitously identical over the length of the fragment (for example, certain regions of SEQ. ID 
NO: 13). A polynucleotide of this invention can be "distinct" from other polynucleotides 
because of an internal sequence difference (a substitution, deletion, or insertion), or because it 
is defined to encompass additional sequence at either end. Also included in the invention are 
recombinant or synthetic polynucleotides in which an al,3GT-like sequence is linked to a 
heterologous sequence to form: for example, a heterologous promoter in an expression vector, 
or a selectable marker such as neo in a targeting vector. 
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Particular polynucleotides of this invention are useful for producing polypeptides of 
interest, as nucleotide probes and primers, and as targeting vectors for genetic knock-outs. 
Further description of the characteristics of such constructs is provided in a later section. 

Preparation: 

The polynucleotides of this invention can be prepared by any suitable technique in the 
art. Using the data provided in this disclosure or deduced from the deposited plasmids, 
sequences of less than -50 base pairs are conveniently prepared by chemical synthesis, either 
through a commercial service or by a known synthetic method, such as the triester method or 
the phosphite method. A suitable method is solid phase synthesis using mononucleoside 
phosphoramidite coupling units (Hirose et al., Tetra. Lett 19:2449-2452, 1978; U.S. Patent 
No. 4,415,732). 

For use in antisense therapy, polynucleotides can be prepared by chemistry that 
produces compounds suitable stable for the pharmaceutical preparation for which they are 
intended. Non-limiting examples include thiol-derivatized nucleosides (U.S. Patent 
5,578,718), oligonucleotides with modified backbones (U.S. Patent Nos. 5,541,307 and 
5,378,825). Also of interest in the context of antisense constructs are peptide nucleic acids. 
Prototype PNAs have an achiral polyamide backbone consisting of N-(2-aminoethyl)glycine 
units, to which purine and pyrimidine bases are linked, for example, by way of a methylene 
carbonyl linker. PNAs are nuclease and protease resistant, and the uncharged nature of the 
PNA oligomers enhances the stability of PNA-nucleotide duplexes, thereby blocking 
transcription or translation. Uptake into cells can be enhanced by conjugating to lipophilic 
groups incorporating into liposomes, and introducing an amino acid side chain into the PNA 
backbone. See Soomets et al., Front BioscL 4:D782, 1999; U.S. Patents 5,539,082, 
5,766,855, 5,786,461, and International Patent Application WO 8/5380L 

Polynucleotides of this invention can also be obtained by PCR amplification of a 
template with the desired sequence. Oligonucleotide primers spanning the desired sequence 
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are annealed to the template, elongated by a DNA polymerase, and then melted at higher 
temperature so that the template and elongated oligonucleotides dissociate. The cycle is 
repeated until the desired amount of amplified polynucleotide is obtained (U.S. Patent Nos. 
4,683,195 and 4,683,202). Suitable templates depend on the nucleotide it is desired to obtain. 
5 Certain polynucleotides of this invention can be obtained from the deposited plasmids. 

Polynucleotides transcribed into mRNA can be obtained from the cDNA of a suitable cell in 
the desired species or a close relative. Cells that express al,3GT include fibroblasts and 
endothelial cells. Intron sequences for al,3GT can be obtained from a genomic DNA library 

of the species of interest. 

10 Production scale amounts of large polynucleotides are most conveniently obtained by 

inserting the desired sequence into a suitable cloning vector and reproducing the clone. 
Techniques for nucleotide cloning are given in Sambrook, Fritsch & Maniatis (supra) and in 
U.S. Patent No. 5,552,524. Exemplary cloning and expression methods are illustrated in 
Examples 1 and 2, below. Polynucleotides can be purified by standard techniques in nucleic 

15 acid chemistry, such as phenol-chloroform extraction, agarose gel electrophoresis, and other 
techniques known in the art, adapted according to the source of the polynucleotide. 

Use of the polynucleotides: 

Polynucleotides of this invention can be used to identify al,3GT nucleotide sequences 

20 in a sample of interest for research, diagnostic evaluation, or any other purpose. Generally, 
this will involve preparing a reaction mixture in which a sample suspected of containing an 
otl,3GT-related sequence is contacted with a polynucleotide of this invention under 
conditions that permit the polynucleotide to hybridize specifically with the compound being 
tested for, detecting any stable hybrids that form, and correlating the hybrids with the 

25 presence of a a 1,3 GT related sequence in the sample. The formation of stable hybrids can be 
detected by any suitable method known in the art. For example, the probe sequence with a 
detectable label such as a radioisotope, a chromophore, or a hapten such as avidin to which an 
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signaling reagent can be attached. Alternatively, the reagent polynucleotide can be a primer for 
an amplification reaction in which the amount of product produced correlates with the 
formation of specific hybrids. 

The specificity of the primer and stringency of hybridization conditions are both 
5 chosen with a view to facilitating detection of sequences of interest, while diminishing false 
positive reactions. Thus, when it is important to distinguish between al,3GT sequences of 
different species, then stringency conditions should be high, and the reagent polynucleotide 
should be nearly identical to the sequence being tested for. Conditions can be determined 
empirically so that the reagent polynucleotide will hybridize with the al,3GT sequence being 

10 tested for but not with al ,3GT sequences from other species. In a more usual application, 
the question to be addressed is whether sample taken from an animal of a particular species 
expresses autologous al,3GT, or not. 

For example, the polynucleotides of this invention can be used to determine whether a 
cell or tissue treated to inactivate the endogenous al,3GT gene expresses al,3GT at the 

1 5 mRNA level Thus, polynucleotides of this invention are used as probes in mRNA blots or as 
primers in reverse PCR to detect endogenous al,3GT species. False positives will not arise 
due to expression of al,3GT from another species, but could arise from cross-reactivity from 
another glycosyl transferase sharing homology with ocl,3GT, or with another expressed 
sequence with a fortuitous sequence match. Again, one of skill in the art will know how to 

20 select the reagent sequence and hybridization conditions based on sequence information and 
empirical testing. Since the degree of sequence identity with other galactosyl transferases is 
typically less than 60%, the practitioner has more latitude in modifying the reagent sequence 
to enhance stability, sensitivity, or for any other purpose. 

Polynucleotides of this invention can also be used to inhibit the transcription or 

25 translation of al ,3GT in target cells, particularly those of the sheep. Such polynucleotides 
can be in the form of antisense constructs, which in some embodiments bind to al,3GT 
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mRNA and prevent translation. Other polynucleotides of this invention are ribozymes having 
a substrate (al,3GT mRNA) binding portion, and an enzymatic portion with endonuclease 
activity that cleaves the substrate. The design and use of ribozymes is described in U.S. 
Patent Nos. 4,987,071, 5,766,942, 5,998,193, and 6,025,167. Particular use of al,3GT 
sequences for the inhibition of porcine al,3GT synthesis is outlined in Strahan et al. 
(Xenotransplantation 2:143, 1995). The effectiveness of transcription inhibitors can be 
determined by transfecting a cell expressing ori,3GT with a test compound, and measuring any 
alteration of expression by the cell — either at the level of mRNA expression (e.g., by 
quantitative mRNA blotting or reverse PCR), at the level of protein expression (e.g., by 
immunoassay using anti-al,3GT), or at the level of Gala(l,3)Gal expression (e.g., by binding 
of a specific antibody or a specific lectin). 

Of particular interest are polynucleotide constructs comprising al,3GT sequence of 

this invention that can be used for altering an endogenous ocl,3GT gene. In general, such 

constructs have a region of at least about 200 base pairs and more typically at least about 1-kb 
that are at least about 95% identical to the genomic sequence of interest, to permit specific 
targeting. The construct will also incorporate a substitution, deletion, or insertion designed to 
modify the targeted gene. 

Also included in this invention are polynucleotides that encode polypeptides of 
interest. Characteristics of the polypeptides of this invention are described in the section that 
follows. For polypeptides that are fragments of naturally occurring ocl 5 3GT, there will be a 
corresponding naturally occurring polynucleotide encoding sequence. Those skilled in the art 
will recognize that because of redundancies in the amino acid code, any polynucleotide that 
encodes a peptide of interest can be used in a translation system to produce the peptide. 
Except where otherwise required, all possible codon combinations that translate into the 
peptide sequence of interest are included in the scope of the invention. 
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The polynucleotides of this invention can be in the form of an expression vector, in 
which the encoding sequence is operatively linked to control elements for transcription and 
translation in a prokaryotic or eukaryotic host cell of interest. Further details of expression 
systems for in vitro peptide production are provided below. 

Also contemplated in this invention are constructs for introducing a transgene into a 
eukaryotic cell, for purposes of expressing al,3GT, and potentially forming the Gala(l,3)Gal 
determinant on cells that would not otherwise present it. Suitable promoters include the 
endogenous al,3GT promoter, predicted to be present in the al,3GT genome sequence 

upstream from the translation start site. Other suitable promoters include constitutive 
promoters such as those for SV40 and CMV. J.H. White (Adv. Pharmacol. 40:339, 1997) 
describes modified steroid receptors and steroid-inducible promoters as genetic switches for 
gene therapy. Walther et al. (J Mol Med. 74:379, 1996) describe cell-type specific and 
inducible promoters for vectors in gene therapy as an approach for cell targeting. 

In particular embodiments, the transgene is inserted into the genome of an embryonic 
cell (or a nucleus subsequently transferred to an embryonic cell). U.S. Patent No. 4,873,191 
describes a process in which genetic material is introduced into a zygote so that it is 
genetically transformed, then transplanting the embryo into a female so that the embryo 
develops to term. International Patent Application WO 99/58703 reports a method of 
preparing transgenic sheep by microinjecting an expression system into a fertilized oosperm, 
identifying embryonic cells using a nested PCR technique, and then transferring a multicellular 
embryo into a suitable host. U.S. Patent No. 5,700,671 reports a method for producing 
mammals with a transgene that encodes a heterologous glycosyltransferase. Other techniques 
useful in preparing transgenic animals can be found, for example, in U.S. Patent Nos. 
4,736,666 5,741,957, and 5,942,435. Individuals with an expressible al,3GT transgene, will 
not have naturally occurring antibodies against Galoc(l,3)Gal, and should accommodate a 
xenotransplant from other animals expressing al,3GT High al,3GT activity in a transgenic 
cell will also decrease the availability of N-acetyl lactosamine acceptor carbohydrate for other 
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glycosyltransferases using the same acceptor, shifting the balance of synthesized 
oligosaccharide structures away from the products of other transferases. 

Polypeptides 

5 The polypeptides of this invention include those that comprise amino acid sequences 

encoded within any of the polynucleotides of this invention, exemplified by SEQ. ID NO:2 
and fragments thereof. Also included in this invention are polypeptides containing al,3GT- 
like sequence that is from naturally occurring allelic variants, synthetic variants, and homologs 
of ocl,3GT with a percentage of residues identical to the al,3GT protein, calculated as 

10 described earlier. 

It is understood that substitutions, insertions, and deletions can be accommodated 
within a protein sequence without departing from the spirit of this invention. Conservative 
substitutions are typically more tolerable, such as the substitution of charged amino acids with 
amino acids having the same charge, or substituting aromatic or lipophylic amino acids with 

15 others having similar features. Certain peptides of this invention are at least 60%, 80% , 90%, 
95%, or 100% identical to one of the sequences exemplified in this disclosure; in order if 
increasing preference. The length of the identical or homologous sequence compared with the 
prototype polypeptide can be about 7, 10, 15, 20, 30, 50 or 100 residues in order of increasing 
preference, up to the length of the entire protein. 

20 Certain peptides of this invention are distinct from peptides already in the hands of 

the public, including previously known ocl,3GT proteins and non-functional homologs from 

other species (e.g., SEQ. ID NOs:4, 6, 8, 10, and 12), and other glycosyl transferases, such as 
the human A and B blood group transferases. A polypeptide of this invention can be 
"distinct" from other polypeptides because of an internal sequence difference (a substitution, 
25 deletion, or insertion), or because it is defined to encompass additional sequence at either end. 
Also included in the invention are artificially engineered fusion proteins in which a al,3GT- 

like sequence is linked to a heterologous sequence which modulates al,3GT activity, provides 
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a complementary function, acts as a tag for purposes of labeling or affinity purification, or has 
any other desirable purpose. 

Particular peptides of this invention are useful for their galactose transferase activity, 
for drug screening, and for raising specific antibody, as described below. mRNA splice 
5 variants of mouse al,3GT have been observed that omit Exons 5 and 6, and may still produce 

functional enzyme. If the ovine al,3GT is analogous, then the catalytic activity probably 
resides somewhere else in the molecule. It is predicted that the catalytic activity of al,3GT 
resides further towards the -COOH terminus, probably at least partly in Exon 9. 

10 Preparation: 

Short polypeptides of this invention can be prepared by solid-phase chemical 
synthesis. The principles of solid phase chemical synthesis can be found in Dugas & Penney, 
Bioorganic Chemistry, Springer-Verlag NY pp 54-92 (1981), and U.S. Patent No. 4,493,795. 
Automated solid-phase peptide synthesis can be performed using devices such as a PE- 

15 Applied Biosystems 430A peptide synthesizer (commercially available from Applied 
Biosystems, Foster City CA). 

Longer polypeptides are conveniently obtained by translation in an in vitro translation 
system, or by expression in a suitable host cell. To produce an expression vector, a 
polynucleotide encoding the desired polypeptide is operably linked to control elements for 

20 transcription and translation, and then transfected into a suitable host cell. Expression may be 
effected in prokaryotes such as E. coli (ATCC Accession No. 31446 or 27325), eukaryotic 
microorganisms such as the yeast Saccharomyces cerevisiae, or higher eukaryotes, such as 
insect or mammalian cells. Control elements such as the promoter are chosen to permit 
translation at an acceptable rate under desired conditions. A number of expression systems 

25 suitable for producing the peptides of this invention are described in U.S. Patent No. 
5,552,524. Expression cloning is available from such commercial services as Lark 
Technologies, Houston TX. 
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Following production, the protein is typically purified from the producing host cell by 
standard methods in protein chemistry in an appropriate combination, which may include ion 
exchange chromatography, affinity chromatography, and HPLC. Expression products are 
optionally produced with a sequence tag to facilitate affinity purification, which can 
subsequently be removed by proteolytic cleavage. 

Use of the polypeptides: 

Polypeptides of this invention can be used for a number of worthwhile purposes, 
including but not limited to the production of Gala(l,3)Gal-containing synthetic 
oligosaccharides, for drug screening, and for production of antibody. 

At least three activities are present in the native al,3GT: binding of the substrate 
UDP-gal, binding of the acceptor oligosaccharide N-acetyl lactosamine, and catalytic activity 
whereby the acceptor oligosaccharide is galactosylated. The binding site for each acceptor can 
be determined by contacting a test fragment or homolog with the substrate, or with a 
competitive inhibitor that would be expected to bind with higher affinity, and determining 
whether a complex forms by a suitable separation technique, such as HPLC, or by an 
equilibrium quantitation technique, such as microchip analysis, which will better detect low- 
affinity reactions. 

The site of catalytic activity can be determined by contacting the test peptide with 
both UDP-galactose and acceptor oligosaccharide, and measuring any Gala(l,3)Gal formed, 
for example, by immunoassay. Optionally, a systematic approach can be used to determine 
functional regions and homologs of al,3GT according to any of these assays. For example, 

the assay system is confirmed on the native al,3GT structure; then a series of nested 
fragments is tested to determine the minimum fragment that provides the same activity. 
Similarly, substitutions can be introduced into the sequence until the activity is ablated, 
thereby determining what residues are critical. Henion et al (Glycobiology 4:193, 1994) 
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characterized the minimal size for catalytically active al,3GT in New World monkeys, finding 
that -68 amino acids could be removed from the N-terminal end before losing activity, while 
removal of only ~3 amino acids from the C-terminal end resulted in loss of activity. 

Native al,3GT, and fragments and homologs that preserve substrate binding or 
5 catalytic activity, can be used to screen for activity inhibitors and enhancers. For example, a 
reaction mixture can be prepared containing UDP-galactose, a carbohydrate acceptor, a 
peptide having ocl,3GT activity, and a test compound. The rate of formation of Gala(l,3)Gal 
is then measured. When compared to a reaction mixture not containing the test compound, a 
decrease in activity correlates with inhibitory capacity of the test compound, while an increase 

10 in activity correlates with augmentation capacity. 

Native ocl,3GT, and fragments and homologs with ocl,3GT activity are also of interest 
for the artificial production of complex oligosaccharides containing a Galoc(l ,3)Gal 
determinant, or which build upon the Gala(l,3)Gal linkage. Such structures may be employed 
for any worthwhile purpose. For example, oligosaccharides that terminate in Gala(l,3)Gal are 

15 of interest for use in the context of minimizing a rejection reaction in an individual 

xenotransplanted with a Galoc(l ,3)Gal-expressing antigen. Substances containing a high 
density of the determinant can be used as an adsorbant to remove naturally occurring anti- 
Gala(l,3)Gal antibodies from plasma by ex vivo recirculation, or by adsorption in situ (see 
WO 98/42750). Certain Gala(l,3)Gal polymers, built into structures that have T-cell 

20 inhibitory epitopes, are proposed as immunosuppressants for the natural antibody against 
Gala(l,3)Gal. Catalytic al,3GT peptides can optionally be adapted with amino acid 
additions, deletions, or substitutions for any worthwhile purpose, such as to improve stability 
or modify the specificity of the transferase in any way that is desirable. 

Peptides of this invention are also of interest for the preparation and testing of 

25 antibodies against al ,3GT. A polyclonal antiserum raised against intact al ,3GT can be 
screened to map immunogenic portions of the primary sequence. To do this, sequential 
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peptides about 12 residues long are synthesized that cover the entire protein (SEQ. ID NO:2), 
and overlapping by about 8 residues. The peptides can be prepared on a nylon membrane 
support by standard F-Moc chemistry, using a SPOTS™ kit from Genosys according to 
manufacturer's directions. Prepared membranes are overlaid with the antiserum, washed, and 
5 overlaid with (3-galactosidase conjugated antiimmunoglobulin. Positive staining identifies 
antigenic regions, which, in an appropriate context, may themselves be immunogenic. There 
will also be antibodies that span different parts of the primary structure, or which rely on a 
conformational component not displayed in smaller peptides. 

Suitable methods for raising and testing ccl,3GT antibodies are described in the 
1 0 following section. 

Antibodies 

Antibody molecules of this invention include those that are specific for any novel 
peptide encompassed in this disclosure. These antibodies are useful for a number of 
1 5 purposes, including assaying for the expression of al ,3GT, and purification of al ,3GT 

peptides by affinity purification. 

Polyclonal antibodies can be prepared by injecting a vertebrate with a polypeptide of 
this invention in an immunogenic form. If needed, immunogenicity of a polypeptide can be 
enhanced by linking to a carrier such as KLH, or combining with an adjuvant, such as Freund's 

20 adjuvant. Typically, a priming injection is followed by a booster injection is after about 4 
weeks, and antiserum is harvested a week later. If desired, the specific antibody activity can 
be further purified by a combination of techniques, which may include Protein-A 
chromatography, ammonium sulfate precipitation, ion exchange chromatography, HPLC, and 
immunoaffinity chromatography using the immunizing polypeptide coupled to a solid 

25 support. Antibody fragments and other derivatives can be prepared by standard 

immunochemical methods, such as subjecting the antibody to cleavage with enzymes such as 
papain, pepsin, or trypsin. 
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Any unwanted cross-reactivity can be removed by treating the polyclonal antibody 
mixture with adsorbants made of those antigens attached to a solid phase, and collecting the 
unbound fraction. Contaminating activity against other transferases of the same species, or 
against al,3GT from other species, can all be removed by this technique if such cross- 
5 reactivity would interfere with the intended use of the antibody. Specificity of the original 
antisera can be improved to start with, by immunizing with peptide fragments of al,3GT that 
are substantially distinct from the equivalent region of the homologous protein. 

Production of monoclonal antibodies is described in such standard references as 
Harrow & Lane (1988), U.S. Patent Nos. 4,491,632, 4,472,500 and 4,444,887, and Methods in 
10 Enzymology 73B:3 (1981). Briefly, a mammal is immunized as described above, and antibody- 
producing cells (usually splenocytes) are harvested. Cells are immortalized, for example, by 
^0 fusion with a non-producing myeloma, transfecting with Epstein Barr Virus, or transforming 
W with oncogenic DNA. The treated cells are cloned and cultured, and the clones are selected 

that produce antibody of the desired specificity. 
3 7 1 5 Other methods of obtaining specific antibody molecules (optimally in the form of 

St single-chain variable regions) involve contacting a library of immunocompetent cells or viral 
H particles with the target antigen, and growing out positively selected clones. 
2 Immunocompetent phage can be constructed to express immunoglobulin variable region 

segments on their surface. See Marks et al, New Eng. J. Med 335:730, 1996, International 
20 Patent Applications WO 94/13804, WO 92/01047, WO 90/02809, and McGuiness et al, 
Nature Biotechnol 14:1449, 1996. 

The antibodies of this invention are can be used in immunoassays to detect or 
quantitate any of the polypeptides of this invention, including ocl,3GT. For example, it may 
be desirable to measure al,3GT in a biological sample to determine whether an individual 
25 expresses al ,3GT (and hence the Galal ,3 Gal epitope), or whether a cell has been 

successfully treated to inactivate the al,3GT gene. It may also be desirable to measure 
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ocl,3GT in a biological sample taken from an individual suspected of having a disease that is 
correlated with altered expression of al,3GT, 

General techniques of immunoassay can be found in "The Immunoassay Handbook", 
Stockton Press NY, 1994; and "Methods of Immunological Analysis", Weinheim: VCH 
Verlags gesellschaft mbH ? 1993). The antibody is combined with a test sample under 
conditions where the antibody will bind specifically to any modulator that might be present, 
but not any other proteins liable to be in the sample. The complex formed can be measured in 
situ (U.S. Patent Nos. 4,208,479 and 4,708,929), or by physically separating it from 
unreacted reagents (U.S. Patent No. 3,646,346). Separation assays typically involve labeled 
al,3GT reagent (competition assay), or labeled antibody (sandwich assay) to facilitate 
detection and quantitation of the complex. Suitable labels are radioisotopes such as 125 I, 
enzymes such as (3-galactosidase, and fluorescent labels such as fluorescein. Antibodies of 
this invention can also be used to detect ocl,3GT in tissue sections by such techniques as 
immunohistology or flow cytometry. The antibody is contacted with the tissue, unreacted 
antibody is washed away, and then bound antibody is detected — either directly, or by using 
a labeled antiimmunoglobulin reagent. 

Assays of this nature can also be used in a competitive format to identify antibodies 
that bind to the same epitope on a target compound. In one such format, the reference 
antibody is labeled, and tested for binding to al,3GT or a related peptide under conditions 
that permit specific antibody-peptide complexes to form. In parallel, the al,3GT peptide is 
first reacted with the test antibody, and the labeled reference antibody is added subsequently. 
The assay is read by separating free antibody from peptide-antibody complexes (e.g., by gel 
filtration or precipitation), and then determining the proportion of label associated with the 
complexes. If the bound proportion decreases as a result of preincubating with the test 
antibody, then the test antibody and the reference antibody share overlapping or 
conformationally related epitopes. Epitope sharing can also be evaluated by mapping the 
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binding site of each antibody using peptide fragments, described earlier. Antibodies can also 
be screened to identify those with catalytic inhibitory capacity, according to the inhibition 
screening assay described in the preceding section 

5 Inactivation of the afL3^Galactosvltransferase gene 

This invention provides cells and cell nuclei, in which a preexisting gene for al,3GT 
has been inactivated. This means that synthesis of the Gala(l,3)Gal epitope by at least one 
al,3GT allele has been suppressed. 

A gene for ocl,3GT can be inactivated by any one of a number of transformations. For 

10 example, the gene may be adapted so that no mRNA transcript is produced; for example, by 
deleting or altering a transcription control element, such as a promoter or a transcription start 
sequence. Alternatively, the gene may be adapted so that any mRNA that is produced is not 
transcribable into the protein product. This can be effected, for example, by deleting or 
altering a translation control element, such as a ribosomal binding site or a translation initiation 

15 codon. Alternatively, the gene may be adapted so that any protein that is produced lacks the 
essential features of a glycosyl transferase. For example, the encoding region can be 
interrupted with stop codons, the encoding region can be placed out-of-phase, or critical 
portions of the protein may be missing, such as a structural component or a signal peptide for 
secretion. In a related alternative, the gene may be adapted so that the protein product lacks 

20 the specificity of al ,3GT — either because the catalytic site is removed, or because substrate 
binding specificity has been sufficiently altered so that the enzyme is incapable of 
synthesizing the Galoc(l,3)Gal linkage. 

A principle method for inactivating the al,3GT gene is to actually change the genetic 
structure, making the inactivation inheritable by progeny of the cell Changes of this nature 

25 can be effected by disrupting the genome with an integrating vector or mutating the genome by 
chemical or biological treatment. Treated cells are then cloned, and tested to determine if 
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al,3GT expression has been affected. Suitable means of testing include determination of 
mRNA expression by probe or amplification based assays, or determination of al,3GT 
protein expression by immunoassay of cell lysates or immunohistochemical staining of fixed 
cells. 

Efficient targeting of the ocl,3GT gene generally requires a targeting vector, comprising 
nucleotide sequence identical or nearly identical to a portion of the gene of interest, linked to 
another structure capable of introducing the alteration. One such method uses homologous 
recombination, in which a DNA vector comprising homologous regions recombines at the 
targeted site, substituting its DNA sequence for that of the target. Cloned cells that have been 
selectively targeted can be identified by PCR amplification of a sequence exclusive to the 
targeting vector, restriction analysis of the recombination site, or expression phenotype. 

Generally it is more convenient to include a selectable marker in the targeting construct, 
so that targeted cells can rapidly be separated from untargeted cells. U.S. Patent 5,614,396 
describes a method for obtaining a cell containing a desired sequence in the cell's genome, by 
using a targeting vector having two regions homologous to the targeting sequence, flanking the 
sequence which is to be inserted, and also having a selectable marker. The DNA undergoes 
homologous recombination at the target site, and recombined cells are recovered under selective 
culture conditions. Positive selection markers include the neo gene, selectable using G418 or 
kanamycin; the hyg gene, selectable using hy gomycin the gpt gene, selectable using xanthine, 
and hypoxanthine-phosphoribosyltransferase (HPRT), selectable using hypoxanthine. 
Negative selection markers include thymidine kinase(TK), selectable using acyclovir or 
gancyclovir; and HPRT, selectable using 6-thioguanine; and cytosine deaminase, selectable 
using 5-fluoro-cytosine. Markers can also be labels like green fluorescent protein or 
p-galactosidase, which permit clones of targeted cells to be identified and selected. In certain 
contexts, it is advantageous to use markers with reduced potential for antigenicity in humans. 
For example, U.S. Patent No. 6,020,192 illustrates a humanized green fluorescent protein. 
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For effecting homologous recombination, U.S. Patent Nos. 5,464,764 and 5,631,153 
describe a double-selection strategy, in which two sequences homologous to the gene target 
flank a positive selection marker, and a negative selection marker is attached to the 3' terminal 
of the second flanking region. Homologous integration retains the positive selection marker, 
but eliminates the negative selection marker, whereas random integration usually retains both 
markers. Thus, by screening for both markers sequentially or together, cells that have been 
correctly targeted will be positively selected, and those that have been incorrectly targeted are 
selected out. U.S. Patent 5,789,215 reports the use of homologous recombination targeting 
vectors for modifying the cell genome of mouse embryonic stem cells. Other information of 
interest for homologous recombination targeting can be found in U.S. Patent Nos. 5,589,369 
and 5,776,774. 

Example 3 describes illustrative targeting vectors according to this invention. They are 
capable of inactivating the sheep ocl,3GT gene by eliminating part of the protein coding region 
via homologous recombination. The vectors comprise flanking regions identical to the targeted 
al,3GT sequence, one side being about 1-kb, the other being at least 1 or 2-kb, in either order. 
In between the flanking regions is a selectable marker such as neo, designed to replace one of 
the Exons in the ocl,3GT coding sequence. The selectable marker genes are not provided with 
their own promoter, and require continued translation through the upstream al,3GT sequence 
in order to be expressed. This helps the marker select for properly integrated vector, because 
vector inserted at a random site will probably not link the marker gene to a suitable promoter, 
and resistance to the selector drug will not be conferred. This strategy is particularly effective 
for cell types that normally express a high level of al ? 3GT and the Gala(l,3)Gal epitope. In 
these cells, the al,3GT promoter will be highly active and the drug resistance marker will be 
strongly expressed. Thus, a higher concentration of selector drug can be used to select out 
cells that have the vector inserted elsewhere. 
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Vectors p0054, p0079, and p0063 (Figures 9, 10, and 1 1) are targeted to eliminate 
Exon 4, which contains the al,3GT translation start codon. Vector p0078 (Figure 12) is 
targeted to eliminate Exon 8, and contains an additional feature of interest. The sequence 
replacing Exon 8 has an encoding region for a poly-glycine linker, followed by the neo gene. 

5 Since neo is translated within the sequence of the rest of the al ,3GT protein, the linker should 
improve the flexibility or accessibility of the neo gene product and enhance the neomycin 
resistance conferred. Vector p0047 and p0046 are designed to replace Exon 9, which is 
thought to contain at least part of the al,3GT catalytic site. 

All these vectors, and other designs that would be effective in inactivating ovine 

10 al ,3GT, can be constructed using the sequence data provided in this disclosure and optionally 
the deposited phage plasmids, according to standard protocols of genetic engineering. Also 
suitable are insertion vectors (the general structure of which is reported, for example, by 
Hasty et al, Mol. Cell Biol 1 1 : 4509, 1991). In this case, homologous recombination results 
in insertion of the linearized vector sequence, thereby disrupting the gene. Also suitable are 

15 shorter oligonucleotides, such as chimeroplasts (A. Cole-Strauss et al., Science 273:1386, 
1996), which can direct single base pair changes to a specific gene sequence. 

The targeting vectors can be used to inactivate al ,3GT in sheep cells, or the cells of 
other phylogenetically related species that have sufficient identity through the targeted area 
that homologous recombination will take place. Of particular interest are ruminants of the 

20 subfamily caprinae, especially ovids and pseudois, exemplified by the sheep of any domestic 
breed. 

Example 4 provides a non-limiting illustration of the use of such vectors. A suitable 
cell line is combined with the vectors in a culture medium, and the vectors are introduced into 
the cell. In the illustration, the vectors are introduced by optimized conditions of 
25 electroporation. The cells are then cultured for a time in an appropriate medium for 

maintenance of the cells, during which time the recombination event should occur. The cells 
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are then subjected to culture conditions that permit outgrowth of cells bearing the selectable 
marker as a result of successful recombination. 

Those skilled in the art will recognize that other means of introducing the targeting 
vector into the cell are also acceptable. For example, International Patent Application 

5 WO 29/48005 reports a strategy for modifying a gene in a cell by introducing a parvoviral 
vector comprising a sequence substantially identical to the target except for the modification 
being introduced, and permitting the viral vector to effect homologous recombination, thereby 
modifying the gene sequence. Figure 12 shows targeting constructs based on vectors p0054 
and p0079, modified to ~4-kb, so that they will fit within the genome of adeno-associated 

1 0 virus. U.S. Patent No. 5,932,241 describes cationic lipid DNA complexes for gene targeting. 
U.S. Patent No. 5,804,413 describes herpes simplex virus strains for gene transfer. 

After genetic manipulation has been completed and altered cells have been selected, 
inactivation of the al,3GT gene can be confirmed by suitable testing. Assays for expression 
of al,3GT at the mRNA level and at the protein level have already been described. The 

1 5 nature of the genetic alteration can be determined by PCR amplification using primers 

bracketing the targeted recombination site, and characterizing the amplification product, or by 
Southern analysis. If the targeting vector contains a unique sequence, then correct integration 
can be confirmed using a primer specific for the inserted sequence and a primer specific for 
neighboring al,3GT sequence. Sequence information disclosed in this application relating to 

20 al,3GT unencoding and intron regions (SEQ. ID NOs: 15-25) can be used to develop such 
primers. As illustrated in Example 4, production of amplification product of the predicted 
size confirms correct integration. 

Animals and cells that are homozygous for inactivated q(L3)Galactosyltransferase 
25 Certain aspects of this invention relate to animals and tissues in which the al ,3GT 

gene of at least one haplotype is inactivated. To produce such animals, a cell with this 
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characteristic is provided that has the potential to develop into a fertile embryo. In mice, it is 
possible to inactivate al,3GT in embryonic stem cells, inject the cells into a blastocyst in 
culture, and then engraft the blastocyst into a female which acts as a surrogate to bring the 
genetically altered embryo to term (U.S. Patent No. 5,849,991). 

5 In ungulate animals, a different procedure has been developed. First, an al ,3GT gene 

is inactivated in cultured cells, and then the nucleus of such donor cells is transferred into an 
enucleated recipient cell, such as an oocyte or other pluripotent cell, that is capable of 
developing into a fertile embryo after the nucleus has been transferred. 

International Patent Application WO 97/07669 (Roslin Institute) describes quiescent 

10 cell populations for nuclear transfer. International Patent Application WO 97/07668 (Roslin 
Institute) describes inactivated oocytes as cytoplast recipients for nuclear transfer. 
International Patent Application WO 99/01164 (University of Massachusetts) relates to 
cloning pigs using donor nuclei from differentiated cells. U.S. Patent No. 5,994,619 
(University of Massachusetts) reports production of chimeric bovine or porcine animals using 

15 cultured inner cell mass cells. U.S. Patent No. 6,01 1,197 (Infigen) relates to a method for 
cloning bovines, by reprogramming non-embryonic bovine cells using leukocyte inhibitory 
factor (LIF) and fibroblast growth factor (FGF), then transferring the nucleus into an 
enucleated oocyte. International Patent Application WO 99/21415 (Stem Cell Sciences) 
reports nuclear transfer for production of transgenic animal embryos. Loi et al. (Reprod. Nutr. 

20 Dev. 38:615, 1998) discuss embryo transfer and related technologies in sheep reproduction. 
Wells et al. (Biol. Reprod. 57:385, 1997) report production of cloned lambs from an 
established embryonic cell line. Liu et al. (Mol. Reprod. Dev. 47:255, 1997) discuss the effect 
of cell cycle coordination between nucleus and cytoplasm and the use of in vitro matured 
oocytes in nuclear transfer in sheep embryos. Campbell et al. (Nature 380:65, 1996) report 

25 sheep cloned by nuclear transfer from an established cell line. 

In the practice of this invention, donor cells are typically nucleated cells of the desired 
species that can be maintained easily in culture conditions. An illustrative example is a 
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primary fibroblast line, although other types of cells (such as a stable endothelial cell culture) 
may also be suitable. For the production of al,3GT inactivated sheep, suitable donor cells 
can be taken from fetal tissue of the Black Welsh Mountain or Finn Dorset line. The selected 
breed need not be isogenic for the particular vector constructs illustrated in this disclosure, 
because there will be sufficient identity within the al,3GT gene to be successfully targeted. 
Primary fibroblast cultures can readily produced from fetuses collected at about 35 days of 
gestation, and subjected to a mild trypsin/EDTA solution. Cells are then introduced with the 
targeting vector, cultured long enough to permit genetic alteration to take place, and then 
selected for proper recombination, as described in the previous section. 

Nuclear transfer is particularly effective if the nucleus of the donor cell is quiescent, 
which can be achieved by culturing the donor cell in a serum-free medium (WO 97/07669). In 
an exemplary method, the nucleus of a donor cell is transferred into an oocyte that is arrested 
in the metaphase of the second meiotic division, and subsequently activating the reconstituted 
cell. Briefly, unfertilized metaphase II oocytes are collected as follows: Female animals are 
synchronized using progestagen sponges for -14 days, and induced to superovulate with 
single injections of follicle-stimulating hormone on two successive days. Ovulation is induced 
with a suitable dose of gonadotrophin-releasing hormone or an analog thereof (e.g., -8 mg 
GnRH Receptal™, Hoechst, UK) on the following day. The oocytes are recovered by 
flushing from the oviduct one day later, washed, and enucleated by treating with cytochalasin 
B and aspirating the nucleus using a glass pipette. Enucleated oocytes are then placed into 
contact with a single cell that acts as the nucleus donor. 

Fusion of the donor nucleus into the enucleated recipient cell is effected by placing the 
couplet in a fusion chamber and aligning it between the electrodes. Electrical pulses are then 
applied to induce fusion, typically a low- voltage AC pulse for several seconds, followed by a 
plurality of very short high-voltage DC pulses. Following an incubation period, activation is 
induced by application of an additional electrical pulse. The reconstructed zygote is then 
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cultured for a time before engrafting into a surrogate female. Further details and alternative 
procedures are described in the patent publications cited above. 

Estrus in the surrogate female is typically synchronized artificially using a suitable 
combination of inducing agents. Cameron et al. (Aust Vet. J. 66:314, 1989) discuss 
synchronization methods and other practical aspects for commercial embryo transfer in pigs. 
Blum-Reckow et al. (J. Anim. Sci. 69:3335, 1991) report experiments relating to transfer of 
pig embryos after long-term in vitro culture. Replacing medium every 12 h during culture 
improved survival, and pregnancy rate improved if the sexual cycle of recipients was 24 h 
behind that of the donor. 

The embryos are introduced into the uterus of the recipient female using any suitable 
technique, including devices adapted for the purpose, or appropriate surgical methods. For 
example, U.S. Patent No. 4,326,505 describes surgical procedures for embryo transplants in 
animals, in which the uterine horn is positioned in the peritoneal cavity proximate to the 
vaginal wall, a cannula is inserted through the vaginal wall and into the uterine horn, and the 
embryo is introduced through the cannula. Non-surgical methods include using a suitable 
device to manipulate the injection port through the folds of the cervix to the bifurcation of the 
uterus. For example, devices and techniques for porcine non-surgical embryo transfer are 
reported by Li et al. (J. Anim. Sci. 74:2263, 1996). Wallenhorst et al. (J. Anim. Sci. 77:2327, 
1999) describe the effect of transferring pig embryos to different uterine sites. 

The expression of the Galoc(l,3)Gal epitope is autosomal dominant, and prevention of 
Galoc(l,3)Gal mediated xenograft rejection is improved with homozygous inactivation of 
al ,3GT. One option is to select cells for production of embryos that are already 
homozygously inactivated. For example, the al,3GT gene can be targeted for homologous 
recombination with a construct that incorporates a drug resistance gene, such as neo. Targeted 
cells are first cultured with neomycin at a level that will select for recombination of at least one 
haplotype. Selected cells are then cultured at a higher level of neomycin to select double 
recombinants (U.S. Patent No. 5,589,369). In another example, the cd,3GT gene is targeted 
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simultaneously or sequentially with two different constructs for the same region, each having a 
different drug resistance gene. The Exon 4 targeting vectors p0054 (neo) (Figure 9) and p0063 
(pad) (Figure 1 1) can be used in this capacity. Cells are then selected by culturing with the 
two selection drugs simultaneously or sequentially to select double recombinants. In yet 
5 another example, targeting vectors are used that delete part of the al,3GT gene sequence, 
targeted cells are cloned, and their genomic DNA is subjected to Southern or PCR analysis to 
determine whether the sequence has been deleted from both haplotypes. In a further example, 
the vectors are used to target cells that actively synthesize and present on the plasma 
membrane a high density of Galoc(l,3)Gal (such as endothelial cells). After targeting, the cells 
10 are subject to multiple rounds of proliferation and selection using specific antibody to the 
% determinant, for example, by flow cytometry or complement-mediated lysis. The cell 
■1{ population surviving repeated selection will be incapable of synthesizing the xenogeneic 
Ui oligosaccharide, and will be enriched for cells having homozygous inactivation of ocl,3GT. 
!i Animals homozygous for inactivated al,3GT can also be obtained by first producing 

;L 1 5 animals heterozygous for inactivated al ,3GT according to this disclosure — and then mating 
■* s the heterozygous knockouts once they reach sexual maturity, to obtain progeny that have 
W inherited inactivated al,3GT from both parents. Alternatively, cells from a heterozygous 
Q knockout can be targeted with an inactivation vector according to this invention to inactivate 
al,3GT on the other haplotype, and used to produce an embryo with homozygously 

20 inactivated al,3GT. 

Tissue devoid of Galaf L3^Gal determinants 

Tissue samples can be tested to determine whether al,3GT has been successfully 
inactivated. As described elsewhere in this disclosure, changes to the al ,3GT gene can be 
25 determined by Southern analysis or PCR amplification of genomic DNA; mRNA expression 
can be determined by Northern analysis or reverse PCR, and protein expression can be 
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determined by immunoassay or immunohistochemistry. Elimination of mRNA or protein 
expression correlates with homozygous inactivation of al,3GT, whereas quantitative 
reduction in mRNA or protein expression correlates with inactivation of al,3GT on one 
haplotype. If inactivation of ocl,3GT has been effected at the level of protein catalytic 
activity, then mRNA and protein levels may appear normal. 

A useful test for successful al,3GT inactivation is expression of the Gala(l,3)Gal 
epitope. This can be determined using an antibody or lectin specific for Gala(l,3)Gal. Since 
it is a self-antigen in most mammals, specific antibodies cannot easily be obtained by 
immunizing the usual source animals. However, purified antibody can be obtained from 
pooled human serum by adsorbing on an affinity column of Synsorb 115 (Chembiomed, 
Alberta, Canada) or D(+) melibiose (Sigma). An alternative is the "IB4" lectin from 
Bandeiraea (Griffonia) simplicifolia (Sigma Cat. L 3019) which is specific for a-D-galactosyl 
residues (Hayes et al., J Biol Chem. 25:1904, 1976), and binds both the Gala(l,3)Gal epitope, 
and B blood group substance. In most mammals, Galoc(l,3)Gal is expressed on red blood 
cells, and can be detected in a simple agglutination reaction. The antibody or lectin can be used 
to stain for Gala(l,3)Gal in fixed tissue or in flow cytometry analysis. 

Density of Gala(l ,3)Gal in a particular tissue of interest can be quantitated using a 
complement lysis assay. Briefly, tissue cells are suspended (by treating with trypsin or 
collagenase, if necessary), washed, incubated in a medium containing 51 Cr for a few hours to 
incorporate the label, and then washed again. The labeled targets are then combined with 
diluted human serum as a source of both antibody and complement, and then incubated for 
several hours at 37°C. Release of the label correlates with density of Gala(l,3)Gal on the 
surface of the target cells. For further details of assays for al 5 3GT inactivation and 
Gala(l,3)Gal determination, the reader can consult U.S. Patent No. 5,849,991. 
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Once an animal has been produced with homozygously inactivated al,3GT, there 
should be no synthesis of Gala(l,3)Gal by the al,3GT gene ? and the phenotype should breed 
true. The animal or any progeny bearing the same phenotype can be used as a source of any 
tissue type that is desired for xenotransplantation. Possible harvest tissue includes but is not 
5 limited to any of the following: whole organs, such as kidney, liver, heart, lung, eyes, and 
pancreas; solid tissue, such as skin, cartilage, pancreatic islets, and vasculature of various 
types; and cell suspensions, such as progenitor cells for regeneration of neural tissue, 
hepatocytes, or other cell types. 

10 The following examples provided as further non-limiting illustrations of particular 

embodiments of the invention. 

EXAMPLES 

15 Example 1 : Cloning and characterization of the ovine ocf L3Vgalactosyl transferase cDNA 
The cDNA encoding ovine a(l,3)-galactosyl transferase (GT) was cloned using the 
reverse transcriptase polymerase chain reaction (PCR) amplification technique. 

Two oligonucluceotide primers were prepared. The sense oligonucleotide, 698007 
(5 ' -G AGA A AAT AA TGAATGTCAA AGGA-3'; SEQ. ID NO:26), included 5' 

20 untranslated sequence and the first five codons that encode mature al,3GT. The antisense 
oligonucleotide, 698006 (5 ' -TGATAATCCC AGC AGT ATTC-3 ' ; SEQ. ID NO:27), encoded 
about amino acids 336 to 342. These primers were used in combination with a Titan rt. PCR 
kit (Boehringer Mannheim) and total RNA (TriPure™ RNA extraction kit, Boehringer 
Mannheim) from Black Welsh Mountain (BW) fetal fibroblasts. 

25 The 1035bp PCR product was cloned into pCR-Bluntll-TOPO vector (Invitrogen) 

and the resulting plasmid propagated by transforming TOP 10 competent cells. One colony 
with plasmid containing the correct fragment was selected. Plasmid pOOlO was isolated and 
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the partial cDNA insert sequenced. The final 75 nucleotides of the coding sequence were 
obtained from phage G (see Example 3), giving a complete sequence of 1 1 10 base pairs. 

The 5' untranslated region (UTR) of the ovine od,3GT ge ne was cloned by rapid 
amplification of cDNA ends (RACE) PCR. A 5' race kit (Boehringer Mannheim) was used in 
5 accordance with the manufacturer's instructions. Total RNA was isolated from Black Welsh 
Mountain fetal fibroblasts and a reverse transcriptase reaction carried out. An anchor of 
known sequence was linked to the 5' termini of the cDNA molecules produced. First round 
PCR was performed with sense (supplied) and antisense 798006 (5'-CTTGATGGGT 
TTATCCAGAA CA-3'; SEQ. ID NO:28) primers. Second round PCR product, produced 
10 with the same sense primer but a nested antisense primer (798005, 5 '-CTGTGGATAT 
5 ATTCCC AAAA CAC-3 ' ; SEQ. ID NO :29), was cloned into pCR-Bluntll-TOPO vector. 

Colonies that contained plasmid with insert were selected. One clone was sequenced and 
W extended the ovine sequence by 193bp into the 5' UTR (SEQ ID NO:14). 
P Figure 1 shows the nucleotide sequence of the cDNA (SEQ. ID NO: 1 ). There are 

T 15 193bp at the 5' end that are untranslated. Also shown is the 369 amino acid sequence of the 
S predicted protein product (SEQ. ID NO:2). The encoding region is residues 1 94 to 1 300 of 
SEQ. IDNO:l. 

Q Figures 2 and 3 compare the polynucleotide and amino acid sequences with those of 

the bovine al,3GT (SEQ. ID NOs:3 and 4). Identical residues are marked below with an 
20 asterisk. 

The sheep al,3GT encoding sequence and translation product were compared with 
SEQ. ID NOs 3-1 1 using version 2.0 of the BLASTN, BLASTP, and TBLASTN algorithms 
(National Center for Biotechnology Information; Altschul et al., Nucleic Acids Res. 25:3389, 
1997). Comparison of ovine al,3GT protein encoding sequence across the whole length of 
25 the murine sequence (Jozaisse et al, 1992. J Biol Chem 267:5534-5541) and bovine sequence 
(Joziasse et al, 1989. J Biol. Chem 264:14290-14297) showed homologies of 78% or 95% 
respectively. A more detailed comparison is shown in Table 1. 
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TABLE 1 : Comparison of Sheep oc1 ,3GT w ith other a1 ,3GTsequences 

Comparison rth 8 ^° 1,3 ?L I E^hSTS,^ 

residues matched m matched sequence 



NUCLEOTIDES 

bovine ce1,3GT SEQ. ID NO:3 194-1303 95 % 

marmoset cc1,3GT SEQ. ID NO:5 448-1303 87 % 

pig a1,3GT SEQ. ID NO:7 503-1303 90 % 

mouse a1,3GT SEQ. ID NO:9 657-1303 83 % 

human oc1,3GTpseudogene SEQ. ID NO: 11 620-1303 87 % 



AMINO ACIDS 



bovine a1,3GT 


SEQ, ID NO:4 


1-369 


94% 


marmoset oc1,3GT 


SEQ. ID NO:6 


1-369 


82% 


pig a1,3GT 


SEQ. ID NO:8 


1-369 


82 % 


mouse a1,3GT 


SEQ. ID NO:10 


1-369 


72 % 


human a1 t 3GT pseudogene 


SEQ. ID NO:12 


139-249 


81 % 



Example 2: Expression and Southern blot analysis of the Galactosyl Transferase gene 
5 Based on the comparison of the ovine cDNA sequence with the genome and mouse 

sequence, predictions were made about the organization of the ovine ocl,3GT gene. 
Hybridization probes for the al,3GT gene outside the coding region were designed as follows: 
A partial cDNA probe, produced by excising a 1035bp fragment from plasmid pOOlO 
(Example 1); an Exon 4 probe, produced by concatamerizing 95bp synthetic oligonucleotides 
10 (sense 998003, antisense 998004; SEQ. ID NOs:30 and 31); a 700bp fragment spanning from 
Exon 6 to 7, derived by PCR from phage G (Example 3) using primers 898003 
(5 '-GGTGGTTTCC GAGATGGTTT AACA-3'; SEQ. ID NO:32) and 898005 
(5'-GGGTTGAACC AGTCCGATAG CTTA-3'; SEQ. ID NO:33); a 513bp fragment from 
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Exon 9, produced by PCR using primers 698001 (5 ' -TCC AGG ATGC CTTTGATAGA 
G-3';SEQ. ID NO:34) and 698006 (5 ' -TG AT AATCCC AGCAGTATTC-3 ' ; SEQ. ID 
NO:27). These probes were used for the analysis in this example, and for library screening 
(Example 3). 

5 Figure 4 shows Southern blot analysis to determine the copy number of al,3GT in 

sheep. Ovine genomic DNA was digested with Apal, Asel, BamHI, BsmI, EcoRI or EcoRV, 
then transferred to nitrocellulose membrane, using standard techniques (Sambrook et al, 1989). 
Probing with Exon 4 or Exon 6-7 probes revealed a single band in each lane. The likelihood of 
achieving disruption of the ovine ocl ,3GT gene using the targeting strategies in Example 3 is 
1 0 greater if the gene is transcribed and exists as a single copy in the genome. 
y3 Figure 5 shows Northern blot analysis for gene expression. Northern analysis was 

jfj performed using 8 |ig of Black Welsh Mountain (BW) fetal fibroblast-derived poly- A RNA, 
i7i which was isolated using standard techniques (Sambrook et al 5 1989). Probing with the 

ocl,3GT partial cDNA fragment revealed a single band of approximately 3-kb, similar to that 
1, 15 previously reported in the mouse (Jozaisse et al, 1992. J Biol Chem 267:5534-5541). 
P. 3 Together, these data provide evidence that the al,3GT gene is single copy and is 

]^ expressed in Black Welsh Mountain fetal fibroblasts. 



Example 3: Construction of targeting vectors from genomic Galactosyl Transferase DNA 
20 Construction of vectors to disrupt the ocl,3GT gene by homologous recombination 

requires isolation of genomic sequences, isogenic (or at least sufficiently identical) to 
sequences in the target cells. To facilitate this, genomic DNA was isolated from Black Welsh 
Mountain fetal fibroblasts and a ADASHII phage library constructed. Briefly, Sau3 A 
partially digested genomic DNA was dephosphorylated and ligated to compatible BamHI 
25 vector arms (Stratagene). The ligation products were packaged to produce recombinant phage, 
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which were propagated on spi selective XL1-Blue-MRA(P2) bacterial cells. The resulting 
library had a complexity of 1 .4 x 10 6 recombinants and was subsequently amplified once. 

Using the otl,3GT Exon-4, Exon-6-7 and Exon-9 probes described in Example 2, phage 

clones spanning these regions were isolated according to standard methods (Sambrook et al, 
1989). Six useful clones were isolated. 

Figure 6 shows the restriction map of the al,3GT gene, and the regions included in 

Phage al,3GT clones. The clones designated B and D contain the 5' UTR of the oel,3GT 
gene and Exon 4, which contains the start codon. The clones designated C and N contain 
Exons 4, 5, 6 and 7. The clone G spans Exon 6, 7, 8 and 9. Clone Q contains part of Exon 9 
and the 3' UTR. These phage DNAs reflect the endogenous ocl,3GT gene. They were 
characterized by inter-exon PCR, sequencing and restriction analysis combined with exon- 
specific probing. 

Recombinant phage B, C and G have been deposited as a pooled sample, which can be 
separated using the oligonucleotide probes 1198001 (5 '-GGGAGGAAGC 
GAAGGTGC A-3 ' ; SEQ. ID NO:35), 798006 (5 ' -CTTGATGGGT TTATCCAGAA CA-3'; 
SEQ. ID NO:28) and 698006 (5 ' -TGATAATCCC AGC AGTATTC-3 9 ; SEQ. ID NO:27), 
that recognize 5 'UTR, Exon 5 and Exon 9 sequence, respectively. 

Figure 7 maps the sequenced intron regions of <xl,3GT to their positions in the gene. 

The areas numbered 1 to 9 in the figure correspond to SEQ. ID NOs:15, 16, 19, 20, 21, 22, 23, 
24, and 25, respectively. 

Figure 8 shows the position of exemplary targeting vectors designed for use in 
homologous recombination to disrupt gene expression by excising Exons 4 and 9. These and a 
number of other vectors are described below. 

Figure 9 shows in more detail the promoterless targeting vector designated plasmid 
p0054, directed towards Exon 4. The vector comprises two regions that are complementary 
to genomic sequence; a 1.2-kb 5' arm, which includes sequence from Intron 3 leading up to and 
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including the start codon of the al,3GT gene in Exon 4, and a ~9-kb 3 5 arm that initiates 
1 .5-kb into Intron 4, continuing to Intron 5. Separating these regions is neo R -Qo\yk sequence. 
After homologous recombination the vector confers neomycin phosphotransferase resistance 
to the cells and deletes 1.5-kb of genomic sequence, including the first coding exon of a 1,3 GT 
5 gene. The entire cassette was cloned into pBlueScript™ for propagation in DH5a bacterial 
cells. 

The p0054 vector was constructed by amplifying a truncated left arm (300bp, includes 
EcoRI site) (using primers 199001, 5'-ACGTGGCTCC AAGAATTCTC CAGGCAAGAG 
TACTGG-3\ SEQ. ID NO:36; and 199006, 5 ' -C ATCTTGTTC AATGGCCGAT 

10 CCCATTATTT TCTCCTGGGA A A AG AAAAG-3 ' , with tail complementary to the start 
of neo coding sequence, SEQ. ID NO: 137), and a rceo-polyA sequence obtained from 
Stratagene (using primers 199005, 5 ' -CTTTTCTTTT CCCAGGAGAA AATAATGGGA 
TCGGCCATTG AAC AAGATG-3 ' , SEQ. ID NO:38, with tail complementary to left arm; 
and 199004, 5 ' -C AGGTCG ACG GATCCGAACA AAC-3\ SEQ. ID NO:39). These 

1 5 fragments were used to prime from each other to give a 1 .2-kb fusion product. This was 
ligated to Intron 3 sequence, to extend the left arm, and to ~9-kb of 3 ' sequence to create the 
right arm, which initiates 1.5-kb into Intron 4, continuing to Intron 5. 

Figure 10 shows the promoterless weo-polyA insertion vector designated plasmid 
p0079, also directed towards Exon 4. This vector contains the same left arm-weo-polyA 

20 fusion as in vector p0054, but with a modified right arm of 3.9-kb. The 3' region comprises a 
1.5-kb fragment, generated by PCR (using primers 200011, 5 ' -C AGATCT AAC 
GAGGATTCAA TGTCAAAGGA AAAGTGATTC TGTCAAT-3% SEQ. ID NO:40; and 
499006, 5 9 -CTGAACTGAA TGTTTATCCA GGCCATC-3% SEQ. ID NO:41), which 
extends from the second codon in Exon 4 into Intron 4 ? replacing the sequence deleted in 

25 p0054. The V arm was extended by ligation to a 2.4-kb EcoRV downstream fragment. 

Figure 11 shows the promoterless pac-polyA replacement vector designated plasmid 
p0063, also directed towards Exon 4. Construction of this vector was similar as for p0054, 
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except that it contains the pac gene, which codes for puromycin N-acetyltransferase, rather 
than the neo gene. The pac sequence is available in plasmid pPUR from Clontech. The 
oligonucleotide primers used to generate the 5'-paopolyA fusion were, for the 5' region, 
199001 (SEQ, ID NO:36) and 699002 (5'-GCGCACCGTG GGCTTGTACT 
5 CGGTCATTAT TTTCTCCTGG GAAAAGAAAA G-3', SEQ. ID NO:42), with tail 
complementary to the start of pac coding sequence; and, forpflc-polyA, 699004 
(5 9 -GAG AAA AT A A TGACCGAGTA CAAGCCCACG GTGC-3' SEQ. ID NO:43), with 
tail complementary to left arm, and 699005 (5 ' -CTGGGG ATCC AGACATGATA 
AGATAC-3' SEQ. ID NO:44). 
10 Figure 12 shows restriction fragments of the Exon 4 directed targeting vectors (3.9-kb 

J Nsil / Bglll, from p0054; 3.6-kb Nsil / Clal, from p0079; Figure 10). These fragments are of 
UJ appropriate size for insertion into recombinant adeno-associated virus (AAV) for gene 
[ii targeting according to the general approach outlined in International Patent Application 
jl WO 98/48005. 

J 15 Figure 13 shows the promoterless mo-poly A replacement vector designated plasmid 

S p0078, directed towards Exon 8. The vector comprises a 1 .3-kb left arm, which is 
f[ complementary to sequence from Intron 7 up to and including the first nucleotide of Exon 8, 
□ fused in frame to 5 x GGA (glycine) repeats, then to neo-polyA sequence (obtained from 
Stratagene). The glycine linker is designed to permit more independent rotation of each 
20 functional domain (the remaining fragment of al,3GT, and neo), allowing neo to perform its 
function. The vector was constructed by amplifying the left arm (using primers 200-005, 
5 ' -CTGGTTGGTT CTAGAACAGG AGGA-3', SEQ. ID NO:45; and 200-007, 
5'-CATCTTGTTC AATGGCCGAT CCCATTCCTC CTCCTCCTCC ACTGGTGACA 
AAACAGAGTC CATGAG -3' SEQ. ID NO:46, with tail complementary to the start of neo 
25 coding sequence) and neo-polyA sequence (using primers 200-006, 5'-CTCATGGACT 

CTGTTTTGTC ACCAGTGGAG GAGGAGGAGG AATGGGATCG GCCATTGAAC 
AAGATG -3', SEQ. ID NO:47, with tail complementary to left arm; and 200-008, 
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5 ' -C AGGTCGACG GATCCGAACA AAC -3\ SEQ. ID NO:48). These fragments were 
then used to prime from each other, giving a 2.2-kb fusion product. To complete the targeting 
vector, this product was ligated to a ~9-kb right arm, which initiates 0.7-kb into Intron 8, 
continuing to the 3'UTR. 
5 Figure 14 shows the targeting vector designated plasmid p0047, designed to disrupt 

Exon 9 of the al,3GT gene. The pMCl-weo-polyA replacement vector comprises (from left 
to right) 1 .8-kb of Intron 8 sequence, located 0.4-kb prior to Exon 9; pMCl-neo-polyA 
sequence (Stratagene); and a ~7-kb right arm (HinDIII restriction sites blunt ended using T4 
DNA polymerase obtained from Promega), initiating 60bp upstream of the stop codon of 
1 0 Exon 9, and continuing into the 3 'UTR, 

Figure 15 shows the targeting vector designated plasmid p0046, also directed towards 
Exon 9. The pIRES-pgeo-polyA vector (obtained from Austin Smith, Centre for Genome 

Research, Edinburgh UK) contains the same right arm as p0047 extended to include the 
remainder of Intron 8 and the first ~120bp of Exon 9, which allows expression of the inserted 
15 selectable cassette: the internal ribosome entry site-(3-galactosidase-7?eo*-polyA. 

After homologous recombination, the selectable marker in both p0047 and p0046 are 
designed to replace >80% of Exon 9, which is hypothesized to comprise at least part of the 
catalytic domain of al,3GT. 

20 Example 4: Disruption of the Galactosyl Transferase gene by homologous recombination 

Black Welsh Mountain fetal fibroblasts have previously been used in nuclear transfer 
experiments, creating live offspring (GB Patents 2318578 and 2331751). For the purposes of 
the present experiment, electroporation conditions were optimized using the (3-galactosidase 

marker plasmid, pCMV-Sport-J3gal (obtained from Gibco). Using a 0.4cm cuvette with 3 x 
25 1 0 5 cells (0,8ml, 6 jig plasmid DNA) and a setting of 250 ^iF:400Volts (Gene Pulser, BioRad), 
10-30% of the surviving fibroblasts stained positive for P-gal expression. 
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For targeting electroporations 10, 25 or 100 jig of NotI linearized p0054 vector was 

mixed with lxlO 7 early passage Black Welsh Mountain fetal fibroblasts and pulsed as 
described above. Cells were grown on tissue culture plastic for 24 hours before G418 
(300|Lig/ml) was applied. After 10-14 days colonies were isolated. Half the colony cell 
5 population was expanded for analysis, while the remainder was cryopreserved for later 
recovery. 

Figure 16 shows the results of site specific recombination detected by PCR 
amplification. Wild type and targeted ocl,3GT alleles were detected using sense (399010, 

5 ' -C AGCTGTGTG GGTATGGGAG GG-3'; SEQ. ID NO:49) and antisense (499006, 
^10 5 ' -CTGAACTG A A TGTTTATCCA GGCCATC-3'; SEQ. ID NO:50) PCR primers, 
lO yielding products of 2.8-kb and 2.2-kb, respectively. A second PCR screen with primers 
m 399010 (SEQ. ID NO:49) and 399005 (5 ' - AGCCG ATTGT CTGTTGTGCC CAGTCAT-3'; 
iy SEQ. ID NO:5 1) produced a fragment of 1 .5-kb only in clones that were correctly targeted. 
Iji The frequency of site-specific recombination was 1 in 52 (6 in 3 12) clones in the 10 jig 
0 15 experiment or 1 in 88 (10 in 877) from all electroporations. 

jV The neo R gene was tested in one sample by Southern blot analysis. DNA was digested 

2 with the restriction enzyme BsmI ,and the blot was probed with neo coding sequence, 
O producing a diagnostic band at 5. 5-kb. However, all cryopreserved samples of targeted Black 
Welsh Mountain cells failed to grow after resuscitation. Analysis other than PCR was not 
20 possible due to inadequate amounts of DNA. The PCR data demonstrate successful gene 
disruption by homologous recombination in sheep somatic cells. 

Additional vectors were constructed for targeting Exon 8 or 9, using the pac gene as a 
resistance marker. Transfection of Black Welsh Mountain fetal fibroblasts with the pIRES- 
Pgeo-polyA replacement vector failed to produce successful targeting in the 250 clones 
25 examined. Apparently, the efficiency of this vector was lower than the promoterless neo- 
polyA vector. 
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Example 5: Gene targeting of the ah3GT gene in other breeds 

In this experiment, targeting vectors were used to affect the ocl,3GT gene in cells of a 
different breed. 

Cell lines were created as follows. Finn Dorset (FD) day 35 fetuses were decapitated 
5 and eviscerated. The carcasses were dissociated manually, and then treated with 

trypsin/EDTA solution (Sigma). Primary cultures of fibroblasts were grown in GMEM media 
(Sigma), supplemented with 10% FCS (Globe Farm), on tissue culture plastic coated with 
0.1% gelatin. Cells were exposed to gentamycin (Gibco) for the first 24 hours to prevent 
bacterial contamination. Once confluent, cells were cryopreserved in aliquots of -1 x 10 6 per 
.^10 vial. Euploid lines of male genotype were used for gene targeting. 
)D Cell suspensions were targeted with lOjag of linearized promoterless neo-polyA 

Jfl replacement vector p0054. Electroporation was conducted by applying an electrical pulse of 
\M 125 jlF, 350 Volts. A total of 568 clones were derived from one electroporated primary 
IP culture, designated 7G65F4. Colonies were screened for successful targeting as described in 
o 15 Example 4. Eighteen colonies (3.2%) showed patterns consistent with homogeneous cell 
U populations in which one al,3GT allele was deleted. A further 1 7 colonies containing mixed 

0 populations of targeted and non-targeted cells were detected. Surprisingly, even though the 
DNA of the targeting vectors is not isogenic with Finn Dorset gene sequences the targeting 
frequency was higher than in the experiments with Black Welsh Mountain cells (Example 4). 

20 Homogeneous colonies of targeted FD cells stored in liquid N 2 were thawed rapidly 

and collected by centrifugation. Three clones (3C6, 5E1 and 4H2) from culture 7G65F4 
retained a normal karyotype and fibroblast morphology, and grew successfully. Cells were 
grown for up to five days in normal medium containing 10% FBS, replacing the medium every 
48 hours. The cells were then stressed for up to five days, in medium containing only 0.5% 

25 FBS, replacing the medium every 48 hours. Thereafter, culturing continued in 10% FBS and 
karyotyped. The 4H2 clone was unstable in longer periods of culture, but clones 3C6 and 5E1 
grew in a slow, stable fashion, and thus suitable for nuclear donors. 
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Example 6: Animals in which an a!3GT gene is inactivated 

Successfully targeted fibroblast cell lines 3C6 and 5E1 are used as donors for nuclear 
transfer. The cells are grown for up to five days in 10% FBS, and then starved for up to five 
5 days in 0.5% FBS, as described in Example 5. Prepared cells are trypsinized for -5 min, and 
collected by low-speed centrifugation (800 rpm for 3 min). They are then resuspended in < 
100 \xL 10% FCS medium, and used immediately for nuclear transfer. 

Six weeks before the transfer, adult female breeding sheep are selected, and brought in 
to the facility to acclimatize to the surroundings. For each nuclear transfer experiment, about 
10 8 animals are used as oocyte donors, 2 as temporary recipients, and 10 as final recipients. 
S Oocytes are collected as follows. Time of estrus is controlled in donor animals by 

^ treatment with an intravaginal progestagen sponge for 1 1 to 1 6 days, replacing once if 
necessary. Ovine follicular growth is promoted by injection of gonadotropin, with or 
E without subcutaneous or intramuscular injection of 0.6 mg (2 mL) follicle stimulating hormone 
« 15 (FSH) twice daily over 4 consecutive days, followed by a single injection of 500 i.u. (2.5 mL) 
ffl pregnant mare's serum gonadotropin (PMSG) on the day of sponge removal. Time of 
j s 1 ovulation is controlled by subcutaneous or intramuscular injection of an analogue of 
;S gonadotropin releasing hormone (e.g. Buserelin™, given 24 h after sponge removal). Between 
one and twelve days after the onset of estrus, general anesthesia is induced by an intravenous 
20 injection of a barbiturate, followed by intubation and maintenance with gaseous anesthetics. 
The reproductive tract is exposed by mid- ventral laparotomy, and the oviduct or uterus us is 
temporarily cannulated as a passageway for sterile medium. Three small puncture incisions 
made at sites anterior to the udder, through which a laparoscope, manipulating forceps and a 
needle are used to manipulate the uterus. After oocytes are collected, the incision is sutured 
25 closed, and the donor animal is allowed to recover from anaesthetic. 

Nuclear transfer is conducted as follows. First, the oocytes are stripped of cumulus 
cells by triturating with a pipette and incubating with hyaluronidase. They are then 
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enucleated by removing the first polar body and metaphase plate. A single 3C6 or 5E1 
targeted cell is introduced under the zona of each oocyte. The cell combination is subject to 
simultaneous electrofusion and activation (0.25 kV cm" 1 AC for 5 sec. to align oocyte and 
donor cell, followed by 3 pulses of 1.25 kV cm" 1 DC for 80 (usee to fuse and activate the 
5 reconstructed embryo). The activated cell is maintained in culture overnight at 3 9°C. Next 

day, the cells are embedded in agar chips to protect from macrophages, and then transferred to 
the ligated oviduct of a temporary recipient. 

For the temporary recipient, estrous is controlled by treatment with intravaginal 
progestagen sponge for 1 1 to 16 days, with or with subcutaneous or intramuscular injection of 

10 500 Lu. of PMSG. The timing brings the temporary recipients to estrus -3 days before the 
oocyte donors. Cells are collected under general anesthesia using barbiturate followed by 
gaseous anesthetics. The reproductive tract is exposed by midventral laparotomy; placing 
ligatures of silk at each uterotubal junction, and embryos are transferred through the fimbriated 
end of the oviduct. The laparotomy is then closed, and a long-acting antibiotic is administered. 

15 Blood samples of < 20 mL (< 5% of blood volume may be taken weekly for monitoring 

purposes. The embryo are flushed from the temporary recipient after 6 days, and developing 
embryos are removed from the agar chip. 

Blastocysts and morula are then transferred into the recipients that will carry the 
embryo to term. Estrus is controlled by treatment with an intravaginal progestagen sponge for 

20 1 1 to 16 days, bringing the final recipients to estrus simultaneously with the oocyte donor. 
The permanent recipients are anesthetized by intravenous barbiturate and gaseous anesthetics, 
the reproductive tract is exposed by mid-ventral laparotomy, and the oviduct or uterus is 
temporarily cannulated for transfer of the embiyos. Alternatively, three small puncture 
incisions are made anterior to the udder, and a laparoscope, manipulating forceps and needle 

25 are inserted for manipulation of the uterus. The oviduct or uterus is temporarily cannulated 
for transfer of the embryos, and the incision is sutured closed. 
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Recipients of oocytes with a targeted nucleus, engrafted in the manner outlined, were 
monitored for the status of their pregnancy by subcuaneous ultrasonic scanning on a weekly 
basis. Ten recipients were determined to be pregnant 35 days after engraftment. At the time 
these observations were recorded, insufficient time had passed for any engrafted pregnancies 

5 to reach term. Some of the pregnancies were stable, and some started to regress. A fetus and 
umbilical cord were recovered from one of the regressing pregnancies for analysis. 

Figure 17 shows the results of PCR analysis of the tissue from the umbilical cord. 
Using primers for the 5' untranslated region and the neo gene, a band corresponding to 1 .5 kb 
was detected, which should only be present in progeny from a cell successfully targeted with a 

10 neo-containing deletion vector. Using primers for the 5' and 3' untranslated regions, two 
bands were observed: 2.8 kb, corresponding to the wild type al,3GT gene, and 2.2 kb, 
appropriate for a targeted al,3GT gene. These results are consistent with inactivation of the 

al,3GT gene on one haplotype. 

For animals maintaining their pregnancy, the progress of the fetus is monitored 
1 5 regularly by ultrasound, and brought to term. Blood cells are collected after birth, to verify 
that at least one al,3GT allele has been inactivated. 

BTOTOGICAL DEPOSIT 

20 The recombinant phage designated B, C and G, have been deposited with the National 

Collections of Industrial and Marine Bacteria Limited (NCIMB), 23 St. Machar Drive, 
Aberdeen AB2 1RY, Scotland, United Kingdom (Tel: +44 (0)1224 273332; Fax: +44 (0)1224 
272461). 

A deposit comprising equal concentrations of each phage was received on April 25, 
25 2000, and is catalogued under Accession No. NCIMB 4 1 056. The concentration is 0.8 x 1 0 8 
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pfu/mL for each of the three phage, for a total of 2.4 x 10 8 pfu/mL. Each phage can be 

separated from the mixture using the following probes: 

• Phage B: probe GGGAGGAAGCGAAGGTGCA (SEQ. ID NO:35), 5'UTR 

• Phage C: probe CTTGATGGGTTTATCCAGAACA (SEQ. ID NO:28), Exon 5 

• Phage G: probe TGATAATCCCAGCAGTATTC (SEQ. ID NO:27), Exon 9 
Each recombinant phage was deposited with the MCIMB separately on May 30, 

2000. Accession numbers are: Clone B, No. 41059; Clone C, No. 41060; and Clone G, No. 
41061 . Each deposit has a concentration of 2 x 10 8 pfu/mL. 

The phage are useful for obtaining sequence information about the sheep 
al ,3Galactosyltransferase gene (described in Example 3) and as an amplification template for 

preparing certain polynucleotides of this invention. 

The deposits are made under terms of the Budapest Treaty on the International 
Recognition of the Deposit of Microorganisms for the Purposes of Patent Procedure. For 
purposes of prosecution of this and any related application in the United States, the deposit is 
made under terms and with the undertakings of 37 CFR §§ 1.801-1.808. Subject to 
37 CFR § 1.808(b), all restrictions imposed on the availability to the public of the deposited 
material will be irrevocably removed upon the granting of a U.S. patent thereto. 
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SEQUENCE DATA 



TABLE 2: Sequences listed in this Disclosure 



SEQ. ID NO: 



Designation 



Reference 



1 

2 



Sheep ct1 ,3-GT cDNA sequence 

Sheep a1 ,3-GT amino acid 
sequence 



This Invention. 



This Invention. 



4 
5 



6 
7 



8 
9 



10 



Bovine ot1 ,3-GT cDNA sequence 



Bovine a1 ,3-GT amino acid 
sequence 

Mermoset a1 ,3-GT cDNA 
sequence 



Mermoset a1 ,3-GT amino acid 
sequence 

Pig a1 ,3-GT cDNA sequence 



Pig a1 ,3-GT amino acid sequence 



Mouse a1 ,3-GT cDNA sequence 



Mouse a1 ,3-GT amino acid 
sequence 



GenBank Accession J04989 
Joziasse et al. "Bovine <x1->3- 
galactosyltransferase" J. Biol. Chem. 264, 14290 
(1989) 

GenBank Accession P14769 
Joziasse et al. (1989), supra. 

GenBank Accession S71333 
lymphoid cell line B95.8, mRNA Partial 
Macher et al. "Defining the minimal size of 
catalytically active primate cc1,3 
galactosyltransferase" Glycobiology 4,193 (1994) 

GenBank Accession S71333 
Macher et al., supra. 

GenBank Accession L36152 

Sus scrofa alpha-1 ,3-galactosyltransferase 

mRNA. 

Strahan et al. "cDNA sequence and chromosome 
localization of pig <x1 ,3 galactosyltransferase" 
Immunogenetics 41, 101 (1995) 
See also GenBank Accession L36535 
Sandrin et al. "Characterization of cDNA clones 
for porcine a(1, 3) galactosyl transferase" 
Xenotransplantation (1994) 

GenBank Accession L36152 
Strahan et al., supra. 

GenBank Accession M26925 
Larsen et al. "Isolation of a cDNA encoding a 
murine UDPgalactose:p-D-galactosyl-1 ,4-N- 
acetyl-D-glucosaminide alpha-1 ,3- 
galactosyltransferase" Proc. Natl. Acad. Sci. USA 
86, 8227 (1989) 

See also GenBank Accession IM85153 
Joziasse et al. "Murine alpha-1 ,3- 
galactosyltransferase: A single gene locus 
specifies four isoforms of the enzyme by 
alternative splicing" J. Biol. Chem. 267, 5534 
(1992) 

GenBank Accession M26925 
Larsen et al., supra. 
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TABLE 2: Sequences listed in this Disclosure 


SEQ. ID NO: 


Designation 




11 


Human a1,3-GT pseudogene 
DNA sequence 


GenBank Accession M60263 
Joziasse et al. LFnaractenzation or an aipna- i-^o- 
galactosyltransferase homologue on human 
chromosome 12 that is organized as a processed 
ncenHnnpnp" I Rinl Ohpm 266 6991 (1991) 


12 


Human cc1 t 3-GT pseudogene 
open reading frame amino acid 
sequence 


calculated from 

GenBanK Accession mduzdo 

Joziasse et al., supra. 


13 


Bovine y globin gene and globin 
nseudoaene 


GenBank Accession M63452 
C.H. Duncan (direct submission) 


14 


Exon 4 of the sheep a1 ,3-GT 
gene 


This Invention. 


15 


Pre-Exon 4 sequence (Phage B 
and C) 


This Invention. 
Region 1 of Figure 7 


16 


Post-Exon 4 sequence 
Exon 4 / intron 4 boundary 


This Invention. 
Region 2 of Figure 7 


17 


Post-Exon 4 sequence 


This Invention. 


18 


Post-Exon 4 sequence 


This Invention. 


19 


Post-Exon 4 sequence 


This Invention. 
Region 3 of Figure 7 


20 


Pre-Exon 8 sequence 


This Invention. 
Region 4 of Figure 7 


21 


Intron sequence approx -4kb to 
— 2kb from Exon 9 


This Invention. 
Region 5 of Figure 7 


22 


intron sequence approx -2kb to 
— 1 kd irom exon y 


This Invention. 
Rpaion 6 of Fiaure 7 


23 


Intron sequence 5' to Exon 9 


This Invention. 
Region 7 of Figure 7 


24 


Intron sequence 3' to Exon 9 


This Invention. 
Region 8 of Figure 7 


25 


Intron sequence approx -2kb to 
-1kb from Exon 9 


This Invention. 
Region 9 of Figure 7 


26 to 51 


Probes and PCR primers 


This Invention. 
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Se quence Listing 



<110> Clark Mr., John 
Denning, Chris 

<120> Animal Tissue For Xenotransplantation 
<130> 730/002 

<140> [filed herewith] 
<141> 2000-06-13 

<150> 60/204,148 
<151> 2000-05-15 

<160> 51 

<17 0> Patentln Ver. 2.1 

<210> 1 

<211> 1303 

<212> DNA 

<213> Ovis sp. 

<220> 
<221> CDS 

<222> (194) . . (1303) 

<400> 1 ^ . , n 

agccgaggac gccgccgggg agccgaggct ccggccagcc cccagcgcgc ccagcttctg 60 

cagatcagga gtcagaacgc tgcaccttcg cttcctccca gccctgcctc cttctgcaaa 120 

acggagctca atagaacttg gtacttttgc cttttactct gggaggagag aagcagacga 180 



tgaggagaaa ata atg aat gtc aaa gga aaa gtg att ctg tea atg ctg 
Met Asn Val Lys Gly Lys Val He Leu Ser Met Leu 
15 10 



229 



gtt gtc tea act gtc att gtt gtg ttt tgg gaa tat ate cac age cca 277 
Val Val Ser Thr Val He Val Val Phe Trp Glu Tyr He His Ser Pro 
15 20 25 

gaa ggc tct ttg ttc tgg ata aac cca tea aga aac cca gaa gtc agt 325 
Glu Gly Ser Leu Phe Trp He Asn Pro Ser Arg Asn Pro Glu Val Ser 
30 35 40 

ggc ggc age age att cag aag ggc tgg tgg ttt ccg aga tgg ttt aac 373 
Gly Gly Ser Ser He Gin Lys Gly Trp Trp Phe Pro Arg Trp Phe Asn 
45 50 55 60 

aat ggt tac caa gaa gaa gat gaa gac gta gac gaa gaa aag gaa caa 421 
Asn Glv Tyr Gin Glu Glu Asp Glu Asp Val Asp Glu Glu Lys Glu Gin 

65 70 75 

aga aag gaa gac aaa age aag ctt aag eta teg gac tgg ttc aac cca 469 
Ara Lvs Glu Asp Lys Ser Lys Leu Lys Leu Ser Asp Trp Phe Asn Pro 
80 85 90 

ttt aaa cgc cct gag gtt gtg act atg aca gat tgg aag gca ccc gtg 517 
Phe Lys Arg Pro Glu Val Val Thr Met Thr Asp Trp Lys Ala Pro Val 
95 100 105 

gtg tgg gaa ggc act tac aac aga gec gtc tta gac gat tac tac gec 565 
Val Trp Glu Gly Thr Tyr Asn Arg Ala Val Leu Asp Asp Tyr Tyr Ala 
110 H5 120 



aag cag aaa att acc gtc ggc ctg acg gtt ttc gec gtc gga aga tac 

Lys Gin Lys He Thr Val Gly Leu Thr Val Phe Ala Val Gly Arg Tyr 

125 130 135 140 

att gag cat tac ttg gag gag ttc tta acg tct get aat aag cac ttc 

He Glu His Tyr Leu Glu Glu Phe Leu Thr Ser Ala Asn Lys His Phe 

145 150 155 



613 



661 



59 



Express Mail EH643728336US 
Filed June 13, 2000 



PATENT APPLICATION 
Docket 730/002 



atg gtt ggc cac cga gtc ate ttt tac gtc atg gtg gac gac gtc tec 
Met Val Gly His Arg Val He Phe Tyr Val Met Val Asp Asp Val Ser 
160 165 170 



709 



agg atg cct ttg ata gag ctg ggc cct ctg cgc tec ttc aaai gtg ttt 757 
Arg Met Pro Leu He Glu Leu Gly Pro Leu Arg Ser Phe Lys Val Phe 
175 180 185 



gag gtc aag cct gag agg agg tgg cag gac gtc age atg gtg cgc atg 805 
Glu Val Lys Pro Glu Arg Arg Trp Gin Asp Val Ser Met Val Arg Met 
190 195 200 

aag acc ate ggg gag cac ate gtg gee cac ate cag cgt gag gtt gac 
Lvs Thr He Gly Glu His He Val Ala His He Gin Arg Glu Val Asp 
205 210 215 220 

ttc etc ttc tgc atg gac gtg gac cag gtc ttc caa gac gag ttc ggg 
Phe Leu Phe Cys Met Asp Val Asp Gin Val Phe Gin Asp Glu Phe Gly 
225 230 235 

gtg gag acc ctg ggt gag teg gtg gec cag eta cag gec tgg tgg tac 
Val Glu Thr Leu Gly Glu Ser Val Ala Gin Leu Gin Ala Trp Trp Tyr 
240 245 250 

aag gca gat ccc gat gag ttt acc tac gag agg cgc aag gag tct gca 
Lys Ala Asp Pro Asp Glu Phe Thr Tyr Glu Arg Arg Lys Glu Ser Ala 
255 260 265 

gca tac att ccc ttc ggc gaa ggg gat ttt tat tac cac gca gee att 
Ala Tyr He Pro Phe Gly Glu Gly Asp Phe Tyr Tyr His Ala Ala He 
270 275 280 

ttt ggg gga aca ccc act cag gtc ctt aac ate acc cag gaa tgc ttc 
Phe Gly Gly Thr Pro Thr Gin Val Leu Asn He Thr Gin Glu Cys Phe 
285 290 295 300 

aaa gga ate etc aag gac aag aaa aat gac ata gaa gee caa tgg cat 1141 
Lvs Glv He Leu Lys Asp Lys Lys Asn Asp lie Glu Ala Gin Trp His 
305 310 315 



gat gag age cat eta aac aag tat ttc ctt etc aac aaa ccc act aaa 
Asp Glu Ser His Leu Asn Lys Tyr Phe Leu Leu Asn Lys Pro Thr Lys 
320 325 330 



gat att aag ctt gtc aag atg tct tgg cag aca aaa gag tat aat gtg 
Asp He Lys Leu Val Lys Met Ser Trp Gin Thr Lys Glu Tyr Asn Val 
350 355 360 

gtt aga aat aac gtc tga 
Val Arg Asn Asn Val 
365 370 

<210> 2 

<211> 369 

<212> PRT 

<213> Ovis sp. 

<400> 2 

Met Asn Val Lys Gly Lys Val He Leu Ser Met Leu Val Val Ser Thr 

1 5 10 15 

Val He Val Val Phe Trp Glu Tyr He His Ser Pro Glu Gly Ser Leu 

20 25 30 

Phe Trp He Asn Pro Ser Arg Asn Pro Glu Val Ser Gly Gly Ser Ser 

35 40 45 

He Gin Lys Gly Trp Trp Phe Pro Arg Trp Phe Asn Asn Gly Tyr Gin 

50 55 60 

Glu Glu Asp Glu Asp Val Asp Glu Glu Lys Glu Gin Arg Lys Glu Asp 
65 70 75 80 

Lvs Ser Lys Leu Lys Leu Ser Asp Trp Phe Asn Pro Phe Lys Arg Pro 

85 90 95 

Glu Val Val Thr Met Thr Asp Trp Lys Ala Pro Val Val Trp Glu Gly 

100 105 HO 

Thr Tyr Asn Arg Ala Val Leu Asp Asp Tyr Tyr Ala Lys Gin Lys He 



853 



901 



949 



997 



1045 



1093 



1189 



ate tta tec ccg gaa tac tgc tgg gat tat cat ata ggc eta cct gcg 1237 
He Leu Ser Pro Glu Tyr Cys Trp Asp Tyr His He Gly Leu Pro Ala 
335 340 345 
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115 




Thr 


Val 


Gly 


Leu 




130 






Leu 


Glu 


Glu 


Phe 


145 








Arg 


VCl-L 


Tie 
lie 


Phe 


He 


Glu 


Leu 


Gly 








180 


Glu 


Arg 


Arg 


Trp 






195 




Glu 


His 


He 


Val 




210 






Met 


Asp 


Val 


Asp 


225 










pin 


O *3 J~ 


Val 


Asp 


Glu 


Phe 


Thr 






260 


Phe 


Gly 


Glu 


Gly 






275 




Pro 


Thr 


Gin 


Val 




290 






Lys 


Asp 


Lys 


Lys 


305 








Leu 


Asn 


Lys 


Tyr 


Glu 


Tyr 


Cys 


Trp 








340 


Val 


Lys 


Met 


Ser 






355 




Val 









120 

Thr Val Phe Ala 
135 

Leu Thr Ser Ala 
150 

Tyr Val Met Val 
165 

Pro Leu Arg Ser 

Gin Asp Val Ser 
200 

Ala His He Gin 
215 

Gin Val Phe Gin 
230 

Ala Gin Leu Gin 
245 

Tyr Glu Arg Arg 

Asp Phe Tyr Tyr 
280 

Leu Asn He Thr 
295 

Asn Asp He Glu 
310 

Phe Leu Leu Asn 
325 

Asp Tyr His He 

Trp Gin Thr Lys 
360 



Val Gly Arg Tyr 
140 

Asn Lys His Phe 
155 

Asp Asp Val Ser 
170 

Phe Lys Val Phe 
185 

Met Val Arg Met 

Arg Glu Val Asp 
220 

Asp Glu Phe Gly 
235 

Ala Trp Trp Tyr 
250 

Lys Glu Ser Ala 
265 

His Ala Ala He 

Gin Glu Cys Phe 
300 

Ala Gin Trp His 
315 

Lys Pro Thr Lys 
330 

Gly Leu Pro Ala 
345 

Glu Tyr Asn Val 



125 

He Glu His Tyr 

Met Val Gly His 
160 

Arg Met Pro Leu 
175 

Glu Val Lys Pro 
190 

Lys Thr He Gly 
205 

Phe Leu Phe Cys 

Val Glu Thr Leu 
240 

Lys Ala Asp Pro 
255 

Ala Tyr He Pro 
270 

Phe Gly Gly Thr 
285 

Lys Gly He Leu 

Asp Glu Ser His 
320 

He Leu Ser Pro 
335 

Asp He Lys Leu 
350 

Val Arg Asn Asn 
365 



<210> 3 

<211> 1617 

<212> DNA 

<213> BOVINE 

<220> 
<221> CDS 

<222> (469) . . (1575) 

<400> 3 ^ 
ccgggggccg ggccgagctg ggagegtega gcccgctgcc cagcgcccgc cggctccctc 60 

gcgcccctgc ccgccgcccc ggaggagege ccggcggccg gecgaeggga gcgcagcggc 12 0 

acaccccgcc ccggcacgcc cgcggggctc gggaggaggc agcgcgccga ctgttccggc 180 

agecgaggae gccgccgggg agecgaggeg ccggccagcc cccagcgcgc ccagcttctg 240 

eggatcaggg aaaccacgtg tcctcaagtg gccagccagc tgtccccaag aggaacttgc 300 

ctggcatttg caeggaaaga cgagacactt cacaaaatca aeggagtcag aaggctgeae 360 

cttcgcttcc tcccagccct gcctccttct geagaaegga gctcagtaga acttggtact 420 

tttacctttt actctaggag gagagaagca gacgatgagg agaaaata atg aat gtc 477 

Met Asn Val 
1 

aaa gga aaa gtg att ctg tea atg ctg gtt gtc tea act gtc att gtt 525 
Lys Gly Lys Val He Leu Ser Met Leu Val Val Ser Thr Val He Val 
5 10 15 



gtg ttt tgg gaa tat ate cac age cca gaa ggc tct ttg ttc tgg ata 

Val Phe Trp Glu Tyr He His Ser Pro Glu Gly Ser Leu Phe Trp He 

20 25 30 35 

aac cca tea aga aac cca gaa gtt ggt ggc age age att cag aag ggc 

Asn Pro Ser Arg Asn Pro Glu Val Gly Gly Ser Ser He Gin Lys Gly 

40 45 50 

tgg tgg ctt ccg aga tgg ttt aac aat ggt tac cat gaa gaa gat gga 

Trp Trp Leu Pro Arg Trp Phe Asn Asn Gly Tyr His Glu Glu Asp Gly 

55 60 65 
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gac ata aac gaa gaa aag gaa caa aga aac gaa gac gaa age aag ctt 717 
Asp He Asn Glu Glu Lys Glu Gin Arg Asn Glu Asp Glu Ser Lys Leu 
70 75 80 

aag eta teg gac tgg ttc aac cca ttt aaa cgc ccc gag gtt gtg acc 7 65 
Lys Leu Ser Asp Trp Phe Asn Pro Phe Lys Arg Pro Glu Val Val Thr 
85 90 95 

atg acg aag tgg aag get cca gtg gtg tgg gaa ggc act tac aac aga 813 
Met Thr Lys Trp Lys Ala Pro Val Val Trp Glu Gly Thr Tyr Asn Arg 
100 105 HO H5 

gec gtc tta gac aat tat tat gee aag cag aaa att acc gtc ggc ctg 
Ala Val Leu Asp Asn Tyr Tyr Ala Lys Gin Lys He Thr Val Gly Leu 
120 125 130 

acg gtt ttc gee gtc gga aga tac att gag cat tac ttg gag gag ttc 
Thr Val Phe Ala Val Gly Arg Tyr He Glu His Tyr Leu Glu Glu Phe 
135 140 145 

tta acg tct get aat aag cac ttc atg gtg ggc cac cca gtc ate ttt 
Leu Thr Ser Ala Asn Lys His Phe Met Val Gly His Pro Val He Phe 
150 155 160 

tat ate atg gta gat gat gtc tec agg atg cct ttg ata gag ttg ggt 
Tyr He Met Val Asp Asp Val Ser Arg Met Pro Leu He Glu Leu Gly 
165 170 175 

cct ctg cgc tec ttc aaa gtg ttt aag ate aag cct gag aag agg tgg 
Pro Leu Arg Ser Phe Lys Val Phe Lys He Lys Pro Glu Lys Arg Trp 
180 185 190 195 



861 



909 



957 



1005 



1053 



cag gac ate age atg atg cgc atg aag act ate ggg gag cac att gtg 1101 
Gin Asp He Ser Met Met Arg Met Lys Thr He Gly Glu His He Val 
200 205 210 

gee cac ate cag cat gag gtt gac ttc ctt ttc tgc atg gat gtg gac 1149 
Ala His He Gin His Glu Val Asp Phe Leu Phe Cys Met Asp Val Asp 
215 ' 220 225 

cag gtc ttc caa gac aag ttt ggg gtg gag acc ctg ggc gag teg gtg 1197 
Gin Val Phe Gin Asp Lys Phe Gly Val Glu Thr Leu Gly Glu Ser Val 
230 235 240 

gec cag eta caa gee tgg tgg tac aag gca gat ccc aat gac ttc acc 1245 
Ala Gin Leu Gin Ala Trp Trp Tyr Lys Ala Asp Pro Asn Asp Phe Thr 
245 250 255 

tac gag agg egg aag gag tct gca gca tac att ccc ttc ggc gaa ggg 12 93 
Tvr Glu Arg Arg Lys Glu Ser Ala Ala Tyr He Pro Phe Gly Glu Gly 
260 265 270 275 

gat ttt tat tac cat gca gec att ttt ggg gga aca ccc act cag gtc 1341 
Asp Phe Tyr Tyr His Ala Ala He Phe Gly Gly Thr Pro Thr Gin Val 
280 285 290 

ctt aac ate acc cag gaa tgc ttc aaa gga ate etc aag gac aag aaa 1389 
Leu Asn He Thr Gin Glu Cys Phe Lys Gly He Leu Lys Asp Lys Lys 
295 300 305 

aat gac ata gaa gec caa tgg cat gat gaa age cat eta aac aag tat 
Asn Asp He Glu Ala Gin Trp His Asp Glu Ser His Leu Asn Lys Tyr 
310 315 320 

ttc ctt etc aac aaa cct act aaa ate tta tec ccg gaa tac tgc tgg 
Phe Leu Leu Asn Lys Pro Thr Lys He Leu Ser Pro Glu Tyr Cys Trp 
325 330 335 

gat tat cac ata ggc eta cct gcg gat att aag ctt gtc aag atg tct 1533 
Asp Tyr His He Gly Leu Pro Ala Asp He Lys Leu Val Lys Met Ser 
340 345 350 355 

tgg cag aca aaa gag tat aat gtg gtt aga aat aat gtc tga 1575 
Trp Gin Thr Lys Glu Tyr Asn Val Val Arg Asn Asn Val 
360 365 

ctttgtgcca gtacatttct gaatttgaga gagtattatt ct 1617 
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<210> 4 
<211> 368 
<212> PRT 
<213> BOVINE 

<400> 4 

Met Asn Val Lys Gly Lys Val He Leu Ser Met Leu Val Val Ser Thr 

1 5 10 15 

Val He Val Val Phe Trp Glu Tyr He His Ser Pro Glu Gly Ser Leu 

20 25 30 

Phe Trp He Asn Pro Ser Arg Asn Pro Glu Val Gly Gly Ser Ser He 

35 40 45 

Gin Lys Gly Trp Trp Leu Pro Arg Trp Phe Asn Asn Gly Tyr His Glu 

50 55 60 

Glu Asp Gly Asp He Asn Glu Glu Lys Glu Gin Arg Asn Glu Asp Glu 
65 70 75 80 

Ser Lys Leu Lys Leu Ser Asp Trp Phe Asn Pro Phe Lys Arg Pro Glu 

85 90 95 

Val Val Thr Met Thr Lys Trp Lys Ala Pro Val Val Trp Glu Gly Thr 

100 105 HO 

Tvr Asn Arg Ala Val Leu Asp Asn Tyr Tyr Ala Lys Gin Lys He Thr 

115 120 125 

Val Gly Leu Thr Val Phe Ala Val Gly Arg Tyr lie Glu His Tyr Leu 

130 135 140 

Glu Glu Phe Leu Thr Ser Ala Asn Lys His Phe Met Val Gly His Pro 
145 150 155 160 

Val He Phe Tyr He Met Val Asp Asp Val Ser Arg Met Pro Leu He 

165 170 175 

Glu Leu Gly Pro Leu Arg Ser Phe Lys Val Phe Lys He Lys Pro Glu 

180 185 190 

Lvs Arg Trp Gin Asp He Ser Met Met Arg Met Lys Thr He Gly Glu 

195 200 205 

His He Val Ala His He Gin His Glu Val Asp Phe Leu Phe Cys Met 

210 215 220 

Asp Val Asp Gin Val Phe Gin Asp Lys Phe Gly Val Glu Thr Leu Gly 
225 230 235 240 

Glu Ser Val Ala Gin Leu Gin Ala Trp Trp Tyr Lys Ala Asp Pro Asn 

245 250 255 

Aso Phe Thr Tyr Glu Arg Arg Lys Glu Ser Ala Ala Tyr He Pro Phe 

260 265 270 

Gly Glu Gly Asp Phe Tyr Tyr His Ala Ala He Phe Gly Gly Thr Pro 

275 280 285 

Thr Gin Val Leu Asn He Thr Gin Glu Cys Phe Lys Gly He Leu Lys 

290 295 300 

Asp Lys Lys Asn Asp He Glu Ala Gin Trp His Asp Glu Ser His Leu 
305 310 315 320 

Asn Lvs Tyr Phe Leu Leu Asn Lys Pro Thr Lys He Leu Ser Pro Glu 

325 330 335 

Tvr Cys Trp Asp Tyr His He Gly Leu Pro Ala Asp He Lys Leu Val 

340 345 350 

Lys Met Ser Trp Gin Thr Lys Glu Tyr Asn Val Val Arg Asn Asn Val 
355 360 365 



<210> 5 
<211> 1131 
<212> DNA 

<213> Mermoset alpha 1,3-GT 

<220> 

<221> CDS 

<222> (1)..(1131) 

<400> 5 

atg aat gtc aaa gga aaa gta att ctg 
Met Asn Val Lys Gly Lys Val He Leu 
1 5 

gtg att gtt gtg ttt tgg gaa tat ate 
Val He Val Val Phe Trp Glu Tyr He 
20 25 

ttg tgg ata tat cac tea aag aac cca 
Leu Trp He Tyr His Ser Lys Asn Pro 
35 40 



teg atg ctg gtt gtc tea act 48 
Ser Met Leu Val Val Ser Thr 
10 15 

aac age cca gaa ggc tct ttc 96 

Asn Ser Pro Glu Gly Ser Phe 

30 

gaa gtt gat gac age agt get 144 

Glu Val Asp Asp Ser Ser Ala 
45 
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cag aag gac tgg tgg ttt cct ggc tgg ttt aac aat ggg ate cac aat 
Gin Lys Asp Trp Trp Phe Pro Gly Trp Phe Asn Asn Gly He His Asn 
50 55 60 

tat caa caa gag gaa gaa gac aca gac aaa gaa aaa gga aga gag gag 
Tvr Gin Gin Glu Glu Glu Asp Thr Asp Lys Glu Lys Gly Arg Glu Glu 
65 70 75 80 

gaa caa aaa aag gaa gat gac aca aca gag ctt egg eta tgg gac tgg 
Glu Gin Lys Lys Glu Asp Asp Thr Thr Glu Leu Arg Leu Trp Asp Trp 

85 90 95 

ttt aat cca aag aaa cgc cca gag gtt atg aca gtg acc caa tgg aag 
Phe Asn Pro Lys Lys Arg Pro Glu Val Met Thr Val Thr Gin Trp Lys 
100 105 HO 

gcg ccg gtt gtg tgg gaa ggc act tac aac aaa gec ate eta gaa aat 
Ala Pro Val Val Trp Glu Gly Thr Tyr Asn Lys Ala He Leu Glu Asn 
115 120 125 

tat tat gee aaa cag aaa att acc gtg ggg ttg acg gtt ttt get att 
Tyr Tyr Ala Lys Gin Lys He Thr Val Gly Leu Thr Val Phe Ala He 
130 135 140 

gga aga tat att gag cat tac ttg gag gag ttc gta aca tct get aat 
Glv Ara Tyr He Glu His Tyr Leu Glu Glu Phe Val Thr Ser Ala Asn 
145 150 155 160 

agg tac ttc atg gtc ggc cac aaa gtc ata ttt tat gtc atg gtg gat 
Ara Tyr Phe Met Val Gly His Lys Val He Phe Tyr Val Met Val Asp 
165 170 175 



aaa gtg ttt gag gtc aag cca gag aag agg tgg caa gac ate age atg 
Lys Val Phe Glu Val Lys Pro Glu Lys Arg Trp Gin Asp He Ser Met 
195 200 205 

atg cgt atg aag acc ate ggg gag cac ate ttg gee cac ate caa cac 
Met Arg Met Lys Thr He Gly Glu His lie Leu Ala His He Gin His 
210 215 220 

gag gtt gac ttc etc ttc tgc atg gat gtg gac cag gtc ttc caa gac 
Glu Val Asp Phe Leu Phe Cys Met Asp Val Asp Gin Val Phe Gin Asp 
225 230 235 240 

cat ttt ggg gta gag acc ctg ggc cag teg gtg get cag eta cag gec 
His Phe Gly Val Glu Thr Leu Gly Gin Ser Val Ala Gin Leu Gin Ala 
245 250 255 

tgg tgg tac aag gca gat cct gat gac ttt acc tat gag agg egg aaa 
Trp Trp Tyr Lys Ala Asp Pro Asp Asp Phe Thr Tyr Glu Arg Arg Lys 
260 265 270 

gag teg gca gca tat att cca ttt ggc cag ggg gat ttt tat tac cat 
Glu Ser Ala Ala Tyr He Pro Phe Gly Gin Gly Asp Phe Tyr Tyr His 
275 280 285 

gca gec att ttt gga gga aca ccg att cag gtt etc aac ate acc cag 
Ala Ala lie Phe Gly Gly Thr Pro He Gin Val Leu Asn He Thr Gin 
290 295 300 

gag tgc ttt aag gga ate etc ctg gac aag aaa aat gac ata gaa gec 
Glu Cys Phe Lys Gly He Leu Leu Asp Lys Lys Asn Asp He Glu Ala 
305 310 315 320 

gag tgg cat gat gaa age cac eta aac aag tat ttc ctt etc aac aaa 
Glu Trp His Asp Glu Ser His Leu Asn Lys Tyr Phe Leu Leu Asn Lys 
325 330 335 

ccc tct aaa ate tta tct cca gaa tac tgc tgg gat tat cat ata ggc 
Pro Ser Lys He Leu Ser Pro Glu Tyr Cys Trp Asp Tyr His lie Gly 
340 345 350 

ctg cct tea gat att aaa act gtc aag eta tea tgg caa aca aaa gag 



64 



192 



240 



288 



336 



384 



432 



480 



528 



gat gtc tec aag gcg ccg ttt ata gag ctg ggt cct ctg cgt tec ttc 57 6 
Asp Val Ser Lys Ala Pro Phe He Glu Leu Gly Pro Leu Arg Ser Phe 
180 185 190 



624 



672 



720 



768 



816 



864 



912 



960 



1008 



1056 



1104 



Express Mail EH643728336US 
Filed June 13, 2000 



PATENT APPLICATION 
Docket 730/002 



Leu Pro Ser Asp He Lys Thr Val Lys Leu Ser Trp Gin Thr Lys Glu 
355 360 365 

tat aat ttg gtt aga aag aat gtc tga 
Tyr Asn Leu Val Arg Lys Asn Val 
370 375 



<210> 6 
<211> 376 
<212> PRT 

<213> Mermoset alpha 1,3-GT 



<400> 6 

Met Asn Val Lys 
1 

Val He Val Val 
20 

Leu Trp He Tyr 
35 

Gin Lys Asp Trp 
50 

Tyr Gin Gin Glu 
65 

Glu Gin Lys Lys 

Phe Asn Pro Lys 
100 

Ala Pro Val Val 
115 

Tyr Tyr Ala Lys 
130 

Gly Arg Tyr He 
145 

Arg Tyr Phe Met 

Asp Val Ser Lys 
180 

Lys Val Phe Glu 
195 

Met Arg Met Lys 
210 

Glu Val Asp Phe 
225 

His Phe Gly Val 

Trp Trp Tyr Lys 
260 

Glu Ser Ala Ala 
275 

Ala Ala He Phe 
290 

Glu Cys Phe Lys 
305 

Glu Trp His Asp 

Pro Ser Lys He 
340 

Leu Pro Ser Asp 
355 

Tyr Asn Leu Val 
370 



Gly Lys Val lie 
5 

Phe Trp Glu Tyr 

His Ser Lys Asn 

40 

Trp Phe Pro Gly 
55 

Glu Glu Asp Thr 
70 

Glu Asp Asp Thr 
85 

Lys Arg Pro Glu 

Trp Glu Gly Thr 
12 0 

Gin Lys He Thr 
135 

Glu His Tyr Leu 
150 

Val Gly His Lys 
165 

Ala Pro Phe He 

Val Lys Pro Glu 
200 

Thr He Gly Glu 
215 

Leu Phe Cys Met 
230 

Glu Thr Leu Gly 
245 

Ala Asp Pro Asp 

Tyr He Pro Phe 
280 

Gly Gly Thr Pro 
295 

Gly He Leu Leu 
310 

Glu Ser His Leu 
325 

Leu Ser Pro Glu 

He Lys Thr Val 
360 

Arg Lys Asn Val 
375 



Leu Ser Met Leu 
10 

lie Asn Ser Pro 
25 

Pro Glu Val Asp 

Trp Phe Asn Asn 
60 

Asp Lys Glu Lys 
75 

Thr Glu Leu Arg 
90 

Val Met Thr Val 
105 

Tyr Asn Lys Ala 

Val Gly Leu Thr 
140 

Glu Glu Phe Val 
155 

Val He Phe Tyr 
170 

Glu Leu Gly Pro 
185 

Lys Arg Trp Gin 

His lie Leu Ala 
220 

Asp Val Asp Gin 
235 

Gin Ser Val Ala 
250 

Asp Phe Thr Tyr 
265 

Gly Gin Gly Asp 

He Gin Val Leu 
300 

Asp Lys Lys Asn 
315 

Asn Lys Tyr Phe 
330 

Tyr Cys Trp Asp 
345 

Lys Leu Ser Trp 



Val Val Ser Thr 
15 

Glu Gly Ser Phe 
30 

Asp Ser Ser Ala 
45 

Gly He His Asn 

Gly Arg Glu Glu 
80 

Leu Trp Asp Trp 
95 

Thr Gin Trp Lys 
110 

He Leu Glu Asn 
125 

Val Phe Ala He 

Thr Ser Ala Asn 
160 

Val Met Val Asp 
175 

Leu Arg Ser Phe 
190 

Asp lie Ser Met 
205 

His lie Gin His 

Val Phe Gin Asp 
240 

Gin Leu Gin Ala 
255 

Glu Arg Arg Lys 
270 

Phe Tyr Tyr His 
285 

Asn He Thr Gin 

Asp lie Glu Ala 
320 

Leu Leu Asn Lys 
335 

Tyr His He Gly 
350 

Gin Thr Lys Glu 
365 



<210> 7 
<211> 1269 
<212> DNA 
<213> Sus scrofa 

<220> 

<221> CDS 

<222> (16) . . (1131) 

<400> 7 

catgaggaga aaata atg aat gtc aaa gga aga gtg gtt ctg tea atg ctg 51 

Met Asn Val Lys Gly Arg Val Val Leu Ser Met Leu 
15 10 



65 
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ctt gtc tea act gta atg gtt gtg ttt tgg gaa tac ate aac age cca 99 
Leu Val Ser Thr Val Met Val Val Phe Trp Glu Tyr He Asn Ser Pro 
15 20 25 

gaa ggt tct ttg ttc tgg ata tac cag tea aaa aac cca gaa gtt ggc 147 
Glu Gly Ser Leu Phe Trp He Tyr Gin Ser Lys Asn Pro Glu Val Gly 
30 35 40 

age agt get cag agg ggc tgg tgg ttt ccg age tgg ttt aac aat ggg 195 
Ser Ser Ala Gin Arg Gly Trp Trp Phe Pro Ser Trp Phe Asn Asn Gly 
45 50 55 60 

act cac agt tac cac gaa gaa gaa gac get ata ggc aac gaa aag gaa 
Thr His Ser Tyr His Glu Glu Glu Asp Ala He Gly Asn Glu Lys Glu 

65 70 75 



gtg ttt gag ate aag tec gag aag agg tgg caa gac ate age atg atg 

Val Phe Glu He Lys Ser Glu Lys Arg Trp Gin Asp He Ser Met Met 

190 195 200 

cgc atg aag acc ate ggg gag cac ate ctg gee cac ate cag cac gag 

Arg Met Lys Thr He Gly Glu His He Leu Ala His He Gin His Glu 

205 210 215 220 



tgg tac aag gca cat cct gac gag ttc acc tac gag agg egg aag gag 
Trp Tyr Lys Ala His Pro Asp Glu Phe Thr Tyr Glu Arg Arg Lys Glu 
255 260 265 

tec gca gec tac att ccg ttt ggc cag ggg gat ttt tat tac cac gca 
Ser Ala Ala Tyr He Pro Phe Gly Gin Gly Asp Phe Tyr Tyr His Ala 
270 275 280 

gee att ttt ggg gga aca ccc act cag gtt eta aac ate act cag gag 
Ala He Phe Gly Gly Thr Pro Thr Gin Val Leu Asn He Thr Gin Glu 
285 290 295 300 

tgc ttc aag gga ate etc cag gac aag gaa aat gac ata gaa gee gag 
Cvs Phe Lys Gly He Leu Gin Asp Lys Glu Asn Asp He Glu Ala Glu 
305 310 315 



243 



caa aga aaa gaa gac aac aga gga gag ctt ccg eta gtg gac tgg ttt 291 
Gin Arg Lys Glu Asp Asn Arg Gly Glu Leu Pro Leu Val Asp Trp Phe 
80 85 90 



387 



435 



aat cct gag aaa cgc cca gag gtc gtg acc ata acc aga tgg aag get 339 

Asn Pro Glu Lys Arg Pro Glu Val Val Thr He Thr Arg Trp Lys Ala 
95 100 105 

cca gtg gta tgg gaa ggc act tac aac aga gee gtc tta gat aat tat 

Pro Val Val Trp Glu Gly Thr Tyr Asn Arg Ala Val Leu Asp Asn Tyr 
110 115 120 

tat gec aaa cag aaa att acc gtg ggc ttg acg gtt ttt get gtc gga 

Tyr Ala Lys Gin Lys He Thr Val Gly Leu Thr Val Phe Ala Val Gly 

125 130 135 140 

aga tac att gag cat tac ttg gag gag ttc tta ata tct gca aat aca 483 

Arg Tyr He Glu His Tyr Leu Glu Glu Phe Leu He Ser Ala Asn Thr 
145 150 155 

tac ttc atg gtt ggc cac aaa gtc ate ttt tac ate atg gtg gat gat 531 

Tyr Phe Met Val Gly His Lys Val He Phe Tyr He Met Val Asp Asp 
160 165 170 

ate tec agg atg cct ttg ata gag ctg ggt cct ctg cgt tec ttt aaa 579 

He Ser Arg Met Pro Leu He Glu Leu Gly Pro Leu Arg Ser Phe Lys 
175 180 185 



627 



675 



gtg gac ttc etc ttc tgc atg gac gtg gat cag gtc ttc caa aac aac 723 

Val Asp Phe Leu Phe Cys Met Asp Val Asp Gin Val Phe Gin Asn Asn 
225 230 235 

ttt ggg gtg gag acc ctg ggc cag teg gtg get cag eta cag gec tgg 771 

Phe Gly Val Glu Thr Leu Gly Gin Ser Val Ala Gin Leu Gin Ala Trp 
240 245 250 



819 



867 



915 



963 



tgg cat gat gaa age cat eta aac aag tat ttc ctt etc aac aaa ccc 1011 
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Trp His Asp Glu Ser His Leu Asn Lys Tyr Phe Leu Leu Asn Lys Pro 
320 325 330 

act aaa ate tta tec cca gaa tac tgc tgg gat tat cat ata ggc atg 

Thr Lys lie Leu Ser Pro Glu Tyr Cys Trp Asp Tyr His He Gly Met 
335 340 345 



1059 



tct gtg gat att agg att gtc aag ata get tgg cag aaa aaa gag tat 1107 
Ser Val Asp He Arg He Val Lys He Ala Trp Gin Lys Lys Glu Tyr 
350 355 360 

aat ttg gtt aga aat aac ate tga ctttaaattg tgccagcagt tttctgaatt 1161 
Asn Leu Val Arg Asn Asn He 
365 370 

tgaaagagta ttactctggc tacttctcca gagaagtagc acttaatttt aacttttaaa 1221 
aaaatactaa caaaatacca acacagtaag tacatattat tcttcctt 12 69 



<210> 8 
<211> 371 
<212> PRT 
<213> Sus scrofa 



<400> 8 

Met Asn Val Lys 
1 

Val Met Val Val 
20 

Phe Trp He Tyr 
35 

Arg Gly Trp Trp 
50 

His Glu Glu Glu 
65 

Asp Asn Arg Gly 

Arg Pro Glu Val 
100 

Glu Gly Thr Tyr 
115 

Lys He Thr Val 
130 

His Tyr Leu Glu 
145 

Gly His Lys Val 

Pro Leu He Glu 
180 

Lys Ser Glu Lys 
195 

He Gly Glu His 
210 

Phe Cys Met Asp 
225 

Thr Leu Gly Gin 

His Pro Asp Glu 
260 

He Pro Phe Gly 
275 

Gly Thr Pro Thr 
290 

He Leu Gin Asp 
305 

Ser His Leu Asn 

Ser Pro Glu Tyr 
340 

Arg He Val Lys 
355 

Asn Asn He 
370 



Gly Arg Val Val 
5 

Phe Trp Glu Tyr 

Gin Ser Lys Asn 
40 

Phe Pro Ser Trp 

55 

Asp Ala lie Gly 
70 

Glu Leu Pro Leu 
85 

Val Thr He Thr 

Asn Arg Ala Val 
120 

Gly Leu Thr Val 
135 

Glu Phe Leu He 
150 

lie Phe Tyr lie 
165 

Leu Gly Pro Leu 

Arg Trp Gin Asp 
200 

He Leu Ala His 
215 

Val Asp Gin Val 
230 

Ser Val Ala Gin 
245 

Phe Thr Tyr Glu 

Gin Gly Asp Phe 
280 

Gin Val Leu Asn 
295 

Lys Glu Asn Asp 
310 

Lys Tyr Phe Leu 
325 

Cys Trp Asp Tyr 

He Ala Trp Gin 
360 



Leu Ser Met Leu 
10 

He Asn Ser Pro 
25 

Pro Glu Val Gly 

Phe Asn Asn Gly 
60 

Asn Glu Lys Glu 
75 

Val Asp Trp Phe 
90 

Arg Trp Lys Ala 
105 

Leu Asp Asn Tyr 

Phe Ala Val Gly 
140 

Ser Ala Asn Thr 
155 

Met Val Asp Asp 
170 

Arg Ser Phe Lys 
185 

He Ser Met Met 

He Gin His Glu 
220 

Phe Gin Asn Asn 
235 

Leu Gin Ala Trp 
250 

Arg Arg Lys Glu 
265 

Tyr Tyr His Ala 

lie Thr Gin Glu 
300 

lie Glu Ala Glu 
315 

Leu Asn Lys Pro 
330 

His He Gly Met 
345 

Lys Lys Glu Tyr 



Leu Val Ser Thr 
15 

Glu Gly Ser Leu 
30 

Ser Ser Ala Gin 
45 

Thr His Ser Tyr 

Gin Arg Lys Glu 
80 

Asn Pro Glu Lys 
95 

Pro Val Val Trp 
110 

Tyr Ala Lys Gin 
125 

Arg Tyr He Glu 

Tyr Phe Met Val 
160 

lie Ser Arg Met 
175 

Val Phe Glu lie 
190 

Arg Met Lys Thr 
205 

Val Asp Phe Leu 

Phe Gly Val Glu 
240 

Trp Tyr Lys Ala 
255 

Ser Ala Ala Tyr 
270 

Ala He Phe Gly 
285 

Cys Phe Lys Gly 

Trp His Asp Glu 
320 

Thr Lys He Leu 
335 

Ser Val Asp He 
350 

Asn Leu Val Arg 
365 



<210> 9 
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<211> 1500 
<212> DNA 
<213> Musca sp. 

<220> 
<221> CDS 

<222> {277) . . (1461) 

<400> 9 ^ cn 

ccttcccttg tagactcttc ttggaatgag aagtaccgat tctgctgaag acctcgcgct 60 

ctcaggctct gggagttgga accctgtacc ttcctttcct ctgctgagcc ctgcctcctt 12 0 

aggcaggcca gagctcgaca gaactcggtt gctttgctgt ttgctttgga gggaacacag 180 

ctgacgatga ggctgacttt gaactcaaga gatctgctta ccccagtctc ctggaattaa 240 

aaqcctgtac tacatttgcc tggacctaag attttc atg ate act atg ctt caa 2 94 

Met He Thr Met Leu Gin 
1 5 

gat etc cat gtc aac aag ate tec atg tea aga tec aag tea gaa aca 
Asp Leu His Val Asn Lys He Ser Met Ser Arg Ser Lys Ser Glu Thr 
10 15 20 

agt ctt cca tec tea aga tct gga tea cag gag aaa ata atg aat gtc 
Ser Leu Pro Ser Ser Arg Ser Gly Ser Gin Glu Lys He Met Asn Val 
25 30 35 

aag gga aaa gta ate ctg ttg atg ctg att gtc tea ace gtg gtt gtc 
Lys Gly Lys Val He Leu Leu Met Leu He Val Ser Thr Val Val Val 
40 45 50 

gtg ttt tgg gaa tat gtc aac aga att cca gag gtt ggt gag aac aga 
Val Phe Trp Glu Tyr Val Asn Arg He Pro Glu Val Gly Glu Asn Arg 
55 60 65 70 

tgg cag aag gac tgg tgg ttc cca age tgg ttt aaa aat ggg ace cac 
Trp Gin Lys Asp Trp Trp Phe Pro Ser Trp Phe Lys Asn Gly Thr His 

75 80 85 

agt tat caa gaa gac aac gta gaa gga egg aga gaa aag ggt aga aat 
Ser Tyr Gin Glu Asp Asn Val Glu Gly Arg Arg Glu Lys Gly Arg Asn 
90 95 100 

gga gat cgc att gaa gag cct cag eta tgg gac tgg ttc aat cca aag 
Gly Asp Arg He Glu Glu Pro Gin Leu Trp Asp Trp Phe Asn Pro Lys 
105 HO H5 

aac cgc ccg gat gtt ttg aca gtg ace ccg tgg aag gcg ccg att gtg 
Asn Arg Pro Asp Val Leu Thr Val Thr Pro Trp Lys Ala Pro He Val 
120 125 130 

tgg gaa ggc act tat gac aca get ctg ctg gaa aag tac tac gee aca 
Trp Glu Gly Thr Tyr Asp Thr Ala Leu Leu Glu Lys Tyr Tyr Ala Thr 
135 140 145 150 

cag aaa etc act gtg ggg ctg aca gtg ttt get gtg gga aag tac att 774 
Gin Lys Leu Thr Val Gly Leu Thr Val Phe Ala Val Gly Lys Tyr He 
155 160 165 



gag cat tac tta gaa gac ttt ctg gag tct get gac atg tac ttc atg 

Glu His Tyr Leu Glu Asp Phe Leu Glu Ser Ala Asp Met Tyr Phe Met 
170 175 180 

gtt ggc cat egg gtc ata ttt tac gtc atg ata gac gac ace tec egg 

Val Gly His Arg Val He Phe Tyr Val Met He Asp Asp Thr Ser Arg 
185 190 195 

atg cct gtc gtg cac ctg aac cct eta cat tec tta caa gtc ttt gag 

Met Pro Val Val His Leu Asn Pro Leu His Ser Leu Gin Val Phe Glu 

200 205 210 

ate agg tct gag aag agg tgg cag gat ate age atg atg cgc atg aag 

He Arg Ser Glu Lys Arg Trp Gin Asp He Ser Met Met Arg Met Lys 

215 220 225 230 

ace att ggg gag cac ate ctg gee cac ate cag cac gag gtc gac ttc 

68 



342 



390 



438 



486 



534 



582 



630 



678 



726 



822 



870 



918 



966 



1014 
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Thr lie Gly Glu His lie Leu Ala His lie Gin His Glu Val Asp Phe 
235 240 245 

etc ttc tgc atg gac gtg gat caa gtc ttt caa gac aac ttc ggg gtg 

Leu Phe Cys Met Asp Val Asp Gin Val Phe Gin Asp Asn Phe Gly Val 
250 255 260 



1062 



gaa act ctg ggc cag ctg gta gca cag etc cag gec tgg tgg tac aag 1110 
Glu Thr Leu Gly Gin Leu Val Ala Gin Leu Gin Ala Trp Trp Tyr Lys 
265 270 275 

gec agt ccc gag aag ttc acc tat gag agg egg gaa ctg teg gec gcg 1158 
Ala Ser Pro Glu Lys Phe Thr Tyr Glu Arg Arg Glu Leu Ser Ala Ala 
280 285 290 

tac att cca ttc gga gag ggg gat ttt tac tac cac gcg gec att ttt 
Tyr He Pro Phe Gly Glu Gly Asp Phe Tyr Tyr His Ala Ala He Phe 
295 300 305 310 

gga gga acg cct act cac att etc aac etc acc agg gag tgc ttt aag 
Glv Gly Thr Pro Thr His He Leu Asn Leu Thr Arg Glu Cys Phe Lys 
315 320 325 

ggg ate etc cag gac aag aaa cat gac ata gaa gec cag tgg cat gat 
Gly He Leu Gin Asp Lys Lys His Asp He Glu Ala Gin Trp His Asp 
330 335 340 

gag age cac etc aac aaa tac ttc ctt ttc aac aaa ccc act aaa ate 
Glu Ser His Leu Asn Lys Tyr Phe Leu Phe Asn Lys Pro Thr Lys He 
345 350 355 

eta tct cca gag tat tgc tgg gac tat cag ata ggc ctg cct tea gat 
Leu Ser Pro Glu Tyr Cys Trp Asp Tyr Gin He Gly Leu Pro Ser Asp 
360 365 370 

att aaa agt gtc aag gta get tgg cag aca aaa gag tat aat ttg gtt 
He Lys Ser Val Lys Val Ala Trp Gin Thr Lys Glu Tyr Asn Leu Val 
375 380 385 390 

aga aat aat gtc tga cttcaaattg tgatggaaac ttgacactat ttctaacca 
Arg Asn Asn Val 

395 



1206 



1254 



1302 



1350 



1398 



1446 



1500 



<210> 10 
<211> 394 
<212> PRT 
<213> Musca sp. 

<400> 10 
Met He Thr Met 
1 

Arg Ser Lys Ser 
20 

Glu Lys He Met 
35 

Val Ser Thr Val 
50 

Glu Val Gly Glu 
65 

Phe Lys Asn Gly 

Arg Glu Lys Gly 
100 

Asp Trp Phe Asn 
115 

Trp Lys Ala Pro 
130 

Glu Lys Tyr Tyr 
145 

Ala Val Gly Lys 

Ala Asp Met Tyr 
180 

He Asp Asp Thr 
195 

Ser Leu Gin Val 
210 



Leu Gin Asp Leu 
5 

Glu Thr Ser Leu 

Asn Val Lys Gly 
40 

Val Val Val Phe 
55 

Asn Arg Trp Gin 
70 

Thr His Ser Tyr 
85 

Arg Asn Gly Asp 

Pro Lys Asn Arg 
12 0 

He Val Trp Glu 
135 

Ala Thr Gin Lys 
150 

Tyr He Glu His 
165 

Phe Met Val Gly 

Ser Arg Met Pro 
200 

Phe Glu He Arg 
215 



His Val Asn Lys 
10 

Pro Ser Ser Arg 
25 

Lys Val He Leu 

Trp Glu Tyr Val 
60 

Lys Asp Trp Trp 
75 

Gin Glu Asp Asn 
90 

Arg He Glu Glu 
105 

Pro Asp Val Leu 

Gly Thr Tyr Asp 
140 

Leu Thr Val Gly 
155 

Tyr Leu Glu Asp 
170 

His Arg Val He 
185 

Val Val His Leu 

Ser Glu Lys Arg 
220 



He 


Ser 


Met 


Ser 






15 




Ser 


Gly 


Ser 


Gin 




30 






Leu 


Met 


Leu 


He 


45 








Asn 


Arg 


He 


Pro 


Phe 


Pro 


Ser 


Trp 








80 


Val 


Glu 


Gly 


Arg 






95 




Pro 


Gin 


Leu 


Trp 




110 






Thr 


Val 


Thr 


Pro 


125 








Thr 


Ala 


Leu 


Leu 


Leu 


Thr 


Val 


Phe 








160 


Phe 


Leu 


Glu 


Ser 






175 




Phe 


Tyr 


Val 


Met 




190 






Asn 


Pro 


Leu 


His 


205 








Trp 


Gin 


Asp 


He 
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Ser 


Met 


Met 


Arg 


Met 


Lys 




He 


225 










230 




Phe 


Gin 


His 


Glu 


Val 


Asp 
245 


Phe 


Leu 


Gin 


Asp 


Asn 


Phe 


Gly Val 


Glu 


Thr 






260 










Gin 


Ala 


Trp 
275 


Trp 


Tyr 


Lys 


fiJ-d 


OtT-L 

AO<J 


Arg 


Glu 


Leu 


Ser 


Ala 


Ala 


Tyr 


lie 


290 










a y d 




Tyr 


His 


Ala 


Ala 


He 


Phe 






305 










310 




He 


Thr 


Arg 


Glu 


Cys 


Phe 
325 


Lys 


Gly 


Glu 


Ala 


Gin 


Trp 
340 


His 


Asp 


Glu 


Ser 


Asn 


Lys 


Pro 


Thr 


Lys 


He 


Leu 


Ser 




355 










360 


He 


Gly 
370 


Leu 


Pro 


Ser 


Asp 


He 
375 


Lys 


Lys 


Glu 


Tyr 


Asn 


Leu 


Val 


Arg 


Asn 


385 










390 









Glu 


His 


He 


Leu 


Ala 


His 


He 




235 










240 








Val 


Asp 


Gin 


Val 


Phe 


250 

*j J V 










255 






Glv 


Gin 


Leu 


Val 


Ala 


Gin 


Leu 


jLi \J J 








270 






Pro 


Gl 11 


Lys 


Phe 


Thr 


Tyr 


Glu 


Arg 










285 










rue 


Gl v 


Glu 


Gly Asp 


Phe 


Tyr 








300 










1I1X 


.t X <w> 


Thr 


His 


He 


Leu 


Asn 


Leu 






315 










320 


Leu 


Gin 


Asp 


Lys 


Lys 


His 


Asp 


He 




330 










335 




His 


Leu 


Asn 


Lys 


Tyr 


Phe 


Leu 


Phe 


345 










350 






Pro 
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<210> 11 

<211> 1885 

<212> DNA 

<213> Homo sapiens 

<400> 11 

tgcctcagcc tcccaaagtg ctggactaca 
atatggtgaa accctttctc tactaaaaat 
aaaataatga atgtcaaagg aaaegtaatt 
gttgtggttt ggaaatgtat caacacccaa 
aaacacccag aagttgttga eggeggeggg 
aacaatggga cccacaatcc tcaagcagag 
aaagaaaaaa gaaaagaaga tgacaaaaaa 
atacgatege ccagaggtgc tgacggtgac 
cacttagaac agaaccatcc tagaaaatta 
acggtttttg ctgtcggaag acatgtagag 
gataggtact tcatggtaag ccacaaagtc 
tgctgttgat agagctgggt cctctgcgtt 
agaggtagca ggtcatcagc atcgtgggca 
acatccaaca ggaagttgac ttcctcttct 
attttggagt ggagaccctg ggccagtccg 
cacatcctga ggagaatact taggagaggc 
gecaggggga tttttattac cacgcagcca 
acatcaccca ggagtgcttt aagggaatcc 
agtggcatga taaagccacc taaacaagta 
atcaccagaa tgctgctggg attatcatta 
agctatcttg gcagcaaaaa gaatatacgt 
tgccaacagt tttttgaatt tgagagaatt 
tttaacttta agaaaatatt aatgaaacac 
gtaactttga gcattgtaat atgagagaat 
gtgatttctt atgtattctg aatttggggg 
gtcataggca aagataaagc cagaaacagg 
agtactttgg gaggecaagg gagtggatca 
ccaacatggt gaaatcccgt ctctactaaa 
cacacctgta atcccaccta ctcaggaggc 
gcagaggttg cattgageca gattgtacca 
ttccatctaa aaacaaacaa acaaaactct 
caggtggtaa gggaagtgtt ggcag 



ggcatgagee aaccacgccc gcccagccaa 60 
acaaaaatta gctgggcaga tgataaggag 120 
cctccagtgc tggttgtctc aactgegatt 180 
gaaggctctc tcttgtggat gtatcactta 240 
cagaagggee ggtggtttcc aagctgette 300 
gaagaagaag acatagacaa agaaaaagag 360 
agagcttegg ctatggaact ggtttaatcc 42 0 
cagttggaat gagccggttg tgtgggaagg 480 
ttatgccaaa cagaaaatta ctgtgggttg 540 
tattacttga aggagttcat ageatctget 600 
atgttttaca teatggcega taatccagga 660 
tcttcagagt gtttgagatc aageccgaga 720 
tgaagaccaa cagggagcac atgttggccc 780 
gcatggatgt ggaccaagtg ttccaagaca 840 
tggctcagct ataggectgg tggtactgga 900 
agaaggagtc ggtggcatac attccatttg 960 
tttttggtgg aacacccatt cactttctca 102 0 
tccaggacag gaaaaatgac ctcgaagcca 1080 
tttcttcctc aacataccct ctaagatctt 1140 
taggtcatca tgaagatatt aaaattgtca 1200 
tggttagaaa tgatgactga cttcaaattg 12 60 
attattctgg ctacttcctc agaaaacaat 1320 
caacattgea atcacatact atctctcctt 1380 
gaatctatgg taatcaggtg taaattccca 1440 
aaatactatc agctgaacca gaaagaactt 1500 
ccaggcacgg eggcttatge ctgtaatccc 1560 
cctgaggtca ggagttcgag accagcttgg 1620 
aatacaaaaa attagceggg catggtggcg 1680 
tgaggcggga gaatcacttg aacctgggag 1740 
ttgtactcca gcctgggcaa aagagtgaaa 1800 
acatgtgtca gataacatcc tggagaaaat 1860 

1885 



<210> 12 

<211> 197 

<212> PRT 

<213> Homo sapiens 

<400> 12 

Ser Arg Met Leu Leu He Glu Leu 
1 5 

Phe Glu He Lys Pro Glu Lys Arg 
20 

Met Lys Thr Asn Arg Glu His Met 



Gly Pro Leu Arg Phe Phe Arg Val 
10 15 

Xaa Gin Val He Ser He Val Gly 
25 30 

Leu Ala His He Gin Gin Glu Val 
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Asp Phe Leu Phe 
50 

Gly Val Glu Thr 
65 

Tyr Trp Thr His 



Val Ala Tyr lie 
100 

He Phe Gly Gly 
115 

Phe Lys Gly He 
130 

His Asp Lys Ala 
145 

Arg Ser Tyr His 



Glu Asp He Lys 
180 



Leu Val Arg Asn 
195 



40 



Cys Met Asp Val 
55 

Leu Gly Gin Ser 
70 

Pro Glu Glu Asn 
85 

Pro Phe Gly Gin 



Thr Pro He His 
120 



Leu Gin Asp Arg 
135 

Thr Xaa Thr Ser 
150 

Gin Asn Ala Ala 
165 

He Val Lys Leu 



Asp 



Asp Gin Val Phe 
60 



Val Ala Gin Leu 
75 

Thr Xaa Glu Arg 
90 

Gly Asp Phe Tyr 
105 

Phe Leu Asn He 



Lys Asn Asp Leu 
140 



He Ser Ser Ser 
155 

Gly He He He 
170 

Ser Trp Gin Gin 
185 



45 

Gin Asp Asn Phe 



Xaa Ala Trp Trp 
80 



Gin Lys Glu Ser 
95 

Tyr His Ala Ala 
110 

Thr Gin Glu Cys 
125 

Glu Ala Lys Trp 



Thr Tyr Pro Leu 
160 

He Gly His His 
175 

Lys Glu Tyr Thr 
190 



<210> 13 

<211> 17141 

<212> DNA 

<213> BOVINE 

<400> 13 

ggagagtaga gtttctgagt ttagacacac 
gagcaacaag agttcatctt acattccccc 
taatctgctc tcagaagcag ggagggcagg 
ggcccctact gcttacacat gcttttgaca 
gtgtatctga ctcttgagaa gaaggctact 
gctgaagttg gtccggatac tgtaggcagg 
gagagtgaat gtcagttggg tgtgtgggga 
ctgactccct ctgaccttgt gctgttttca 
actcagaggt tctttgacta ttgtggggac 
accttctttg tttccaggca tagtttcctc 
tctgcagtat cttctttata tttaaacatt 
tttcttcttt tatgtcactt aaaaattttg 
taagcaagga agacaaaatg atgtattgct 
tgatagcaag ttcagaatta agatagaaag 
ttatatgggt ggcttcacat cagtagtaac 
ttctaggggc acagcttgag atgagactga 
aacattgtcc ttgtttttca tctctaccac 
actggcttga aactttggca aggaattcac 
ggtggctggt gtggttaatg ctctcaccta 
tattttcaaa tgagtattta tttctaattg 
ttactatcat acattaatat gaattaacca 
tctccctctc acctctggcc cattcccacc 
attcgaattc cttgagttga tgaccattgt 
tggcttccag agcagacagt ttcgatccgg 
ttttgcgtgg cagttttatt acagtgaaaa 
cagaagggga acaggagact gtcccactag 
cagttggtta ctaacaatgg aaaggtctta 
aggattagtc agaaggttct taagaaggag 
ttctataatc attagtggac ttccctgatg 
cgagagacct gggttcgatc cctgggttgg 
actccggtac tcttgccttg aaaatccctt 
aggctactgt ccatggggtt gcaaagagtc 
tcactttctt tcaatcatta gtataaagtt 
tgttgtttaa tcatttagtt aagttcagtt 
accccatgaa tcacagcacg ccaggcctcc 
gtcacgtcca tcgagtcggt gatgccatcc 
ctcctgcccc caatccctcc cagcatcagg 
aggtggccaa agtactggag tttcagtttt 
gcttatctcc tttagaatgg actggttgga 



tgaatcagcc aatcacagat gaagagcact 60 
aaaccaatga acttgtatta tgccctgggc 120 
aggctgggtg gggctcacaa ggaagaccag 180 
caacttgcag ctgcacaaac acacatcatg 240 
gtcattgact tgtggagtaa gatgagggtg 300 
caggtattca acttacaagg caggctgaag 360 
cagagccatt gcctgagatt ctggcaggca 420 
ccccctttgc tgctggtcgt ctacccctcg 480 
tgtcctttgc tgatttatgg gcaatgtttt 540 
ttattcattc ttgtttttct ctgtttgttc 600 
ttgactgttt aagtgtttga gtattaagac 660 
tctcatgatt ttcccttatc tcttcttttt 720 
tcttgaaaca gttcaaaaaa ataaaaaaaa 780 
agagaaacat ctctaagtat aaactcaggc 840 
atctacactt tagccatctt tctgcttata 900 
aatactaagt ccaaattggg tgcctctgct 960 
acagctcctg ggcaatatac tgatgagtac 1020 
cccagaattt caggttgcct gtcagaaggt 1080 
caaataccat tgagatcctg tcctattttt 1140 
attgatgatt ggtttacaat attggtttga 1200 
taggtgtatg tatttcccct cccagctgaa 1260 
cctctaggtt attacagagt tcccattcga 1320 
gatagcttat gttgaatggt gtcagatttt 1380 
ggccagagac aaaatttgat cactcagagc 1440 
ggggacagag aaagcttctg acatagacat 1500 
ctagtctcat caagacctta tataattttt 1560 
tcaaaccctc tccctaaatc ctaaaataac 162 0 
aaccatgtcc tcaagcaaca tccattgttt 1680 
gctcagaagg taaagcctct gtctacaatg 1740 
gaagattccc tgagaaggaa atagcaaccc 1800 
gaaaatcccg tggatggagg agcttggtgc 1860 
gggaatgact gcgcgacttc actttcactt 1920 
taaattcatc ttgagcaaga tgaattgttt 1980 
cagtcactca gtcatgtccg actctttgcg 2040 
ctgtccatct cctcccggag ttcacccaac 2100 
agccatctca tcctctgtcg tccccttttc 2160 
gtcttttcca gtgagtcaac tcttcacatg 2220 
agcatcattc cttccaaaca acaccccagg 2280 
tctccctgca gtccaagggg ctcgcaagaa 2 34 0 
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tcttctccaa caccacagtt caaaagcatc aattcttcgg cactcagctt tcttcacagt 2400 
ccaactctca catccgtaca tgaccactgg aaaaaccata gccttgacta gacggacctt 2460 
tgttggaacg taatgtctct gcttttcaat atgctatcta ggttggtcat agctttcctt 2520 
ccaaggagta agcatcattt aatcattagc tctgggctta aagaaagtta atcttcagac 2 580 
tgttgtcaaa acaactcaaa gtttaaggaa aaatttctta tgtgactaag acaatgtaat 2 640 
gcaaaaaaaa atagtttgtc cttttctcct ccttgacaac tccagacccc ttgatggccc 2700 
tggactcctt atcaatctac ctaagagtta actctctcaa ctggaagatc ctatttccct 2760 
acattcttct attgtttaac aattcaaaaa atttaaagtc atttttgagt tacaatattg 2820 
ttcctgttgt ttttatgttc ttgtttttta ggccacaaga catgtcagat cttagctccc 2880 
tgaacagaat agaactctca ccccctgcat tagaccaagc ttaaacacta gactgtcatg 2940 
gaagtcccag tatttcccca gattttatct tctgaacttg gcgaaataat gtccaccctc 3 000 
aatgatatga cttcttccta ataaagaact ttcagctcaa tttcctgatt cattttactc 3060 
attttttttt tttttctttc cgctaaggat atgggaaagt ccctgaggtc tacagataga 312 0 
aaggctttgt gtcttataca gagagaacaa gggaaatgag aaaagtaagg ggaccacaca 3180 
gtcaataacg gatgacattc ctgtctcaaa gcataagaac tataagaagg tttaaaatat 3240 
agtgaggata ctagcattac aggggctcat gggaaacttc aaatttagac gcaaacttca 3300 
gggtaacact ctgatcatat ccactaagta tcaagaaata atgtgctcag agacaaaatg 3360 
attgctcgag aaactgttat taatgtttcc ttttttccta cttcccttgc ctgcacattt 3420 
taggcaacaa tgctatgtta aaccaggtgt tcctattcta atgtataatc tccctctccc 3480 
atttgcccca aaacacctcc aaaattaacc ccaacctccg atttcaggta aagctcagtg 3540 
ctcacataac acttgcttga atcttttgac cttggcatca gtgaaatttc cagtattgcc 3 600 
taaggaaatt cttccaaact ctcccactcc aaacttttca catttcttcc tgtctagcat 3660 
acttcctgtc tagcacaccc tgtgctgtgt ataagtgttt ctgccttttg tactatccta 3720 
cataattacc atgtcttaca tagcagatcc tccatagaca gtaagtgtat gtttactaag 3780 
tgtagtaatc aatgagtgaa caattatttc ttgtcccaaa actcctggga gtagaggaca 3840 
atctgaaaaa tatgctattg catagtgttt ctttggtgtt agtgctacaa tatacaatga 3 900 
ggaagcacac aggaaagtaa ttcactgatt agatatttgc tttctgatta tagtagactt 39 60 
aaaaattttt gaactctatt taatatggag tatatccaat taaaacagtt gtgatagttc 4020 
aaggctcagc aagtgctaac cattaatcat gttccattct ttcccaaact ccctcgcatc 4080 
caggctggca catatcattg attgacttcc ctgtgctatt cagtaagacc ttgttggtta 4140 
tccattttaa agatagcagt gtgtacttgt ccatcccaaa cttcctaact aactatccct 4200 
tgtgaaatta tttttgtcat atgtaaaatt ggaaggacgt caggaacctt ctaacttgca 4260 
tgcatttctt taaatctata tcatgctcca gttgattaaa tttagacttg actacctgaa 4320 
acagatcatg ttgcttcctt ctataaaatg tgcattcaga ataattttat cttactgcca 43 80 
ggttattggc agtgcataaa gtttactgtt tcaccactgt attttgaact gtctgaaggc 4440 
aaatgagctt ttaaatattt atttgtttct tctcatgttt ttatacatag taggcaatga 4500 
gaatccttta caccccaaca ctatgtttaa taaccacagg gtcacatccc agaatcagag 4560 
acatctcagg aaaatcttag caagtttcct ttcatacacc tgagcacaga aaagtcatca 462 0 
taagaattag acattttcaa aagattttaa agagggtaag agagactgtg tgtttcacaa 4680 
tttggaatga gaaagcaaag tacccttctt ccttctaaat gttcataaaa aaatgaaaga 4740 
atatctgagg aacttactgt actgtctttt gttccttcac ctttggaact ggccaatgaa 4800 
tctagataaa aagttgggac tgaaaattcc aatgctggag aattcaagag aacatcactg 4860 
gggtgggatt tggatcaagt tcaaacagtt gctgtattgt tggtgttggt ttctctgtta 4920 
ggagtgaggg agtcaaagga tggaaagaga cttggcagct tgttatatgt tttgattctt 4980 
tgttaaagcg gagcttctgt aaaagtctaa aagtaaaatt aattccttaa tatttcattt 5040 
tatttgataa tttacccaat agttacttct agttccatcc tatcatgttg cttcatagat 5100 
catgtttagt atgtgttaat gcaaatggta tatatatata tagaagaaag aaagatggaa 5160 
agagaccaga ggcaaggaga gaaggacaac gacagggagg cagacacaga gagtcaaaga 5220 
gagagaaagg gggagggggg agggggaagg agaggagaga gagagagaag taagtaatgt 5280 
tgtgcattga tttccataat cattttaaaa tcagccagtt gaatgtcatg ggaatttaga 5340 
atgctattct agattttcca tctcttaagc ctcagagcag tgctctgaaa gttggctttg 5400 
ggagttaatg ccttatatat ttgtcaaggg ttcccagtct gaatccatga ttctgccttt 54 60 
ccagacatac agcttcaaat ttcacttgag aaagactgcc cctttgttct tgataaatca 5520 
atatcatagc ccacttttga ggtaaataaa ggctaattct gaatgaatta aaacctgagg 5580 
ccagtggagg aagtattaat atacaattga taaactgaga cattaactgt cctgtaataa 5640 
ttaggatata ttttattaaa gttattacta tattgacttg actgttttta gaaaaaaaaa 5700 
tatgaccttt taaaacatga gttaaatcgc accatttcta aactattctg tttatgacct 5760 
ctgtttctat gagttctgct cttcaagaag aatttacttc atttcatttt ttgttttatt 5820 
ttattctgct ttatttttga aacaatgcta aaactgaggt aactaagata ttgtacatct 5880 
ggggaagaag attaagtgat aaccacttta tcttgtaata aactttaatg gaaaataatc 5940 
aaataaatag ctactatacc cagaaaaata cagatgagtt tggtgaaata ttatacttgg 6000 
gtaattaaaa attgaacatc aaagcatgta caaaagaatt ctagctgagt gaaatttttg 6060 
aaccaaatga tgaaaccaca cagctctgga tgaatctcat gattctcttg aagaaagtca 6120 
atattcaccc acttaataat actcagatta ggcccatcct ggtatagaac attaggaatt 6180 
aggtcacctt gccagactgc aataaagggg aaaaaaaaac aaaacaaaac tgtctatttc 6240 
tctctgtgtg gacttgtatg agtgtgtgct tgtgtgtgtg tgtgttcagt gcccagtggg 6300 
tgcgaatcaa agtagagcag acatgctgtg gatcccttca cagagtacaa gaaagctcat 63 60 
gatggtgaag atggtagtgg gcctaagaag cattccaaaa cttttttagt ggtgacctgg 642 0 
atttgcactt taggctctga acataaccct acttatataa ctattcacgc tttctttttt 6480 
gtgcctgaac ttttgattgt cttataacaa cttgcagcct tatccctgca gagttatggt 6540 
aatagaaaga aaaacctgtt tcactcttta atcataccaa gtagactgaa gcgacttagg 6600 
cagtagcagc agcagcaatg gtgaaagaca acaactcttg ccctgagact tgcatgatgg 6660 
tggctgcagc aggtcagtac aaaagagggg ctcatagacc attcatggaa caagaaagac 6720 
ataaaaaaga agacacagta gaattatttg taaatatatg tatgtgtgtg tgtgtatgtg 6780 
tatacacacc aatatatact gtaatactac tcaggtataa aaaagaatga catttttcca 6840 
gttgcaacaa cttggatgaa cctgtgggat attatgcttg gtgaaatgtc agacaaagaa 6900 
agacaaaatc ttgtttattg tgacttatat gtggaacata aaaagcaaaa caaacaaatt 69 60 
aatataacaa agcagaaaca aactcacaga tatagagaac aaactactgg ttaccatggg 7020 
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gagagagggg aggggagggc caggataggg aaaggaaatt aaaagataca aattactgtg 7 080 
tacaaaatag attagcaaca aggatatatt gtacagaaca gaaaacatag tcaataattt 7140 
attgcaactt taaatggtgc atagtcctta aaaatactga atcactatgt tgtatatctg 7200 
aaactaatat aatattatag atcaaatatt ttaaaataat taattcgaat caaatgaatt 7260 
agggaaatca tttctgaaaa cagaaaagga taggggaagt ctttaagtac atgctaattt 7320 
ctgtcagtca agagggcaaa attgtagccc tggcaatttg gctcctttac aagaaaatat 7380 
agaaatagac tatactatta gttagtaagt tctgtgagtt ttccataaca cagagttcct 7440 
gaaaagatat gtccaaagtt ttccagaaga agtctttaac tcacatatgc ctgacctctg 7500 
atatcagaga tataaaatag gtcttttaaa gagtacaaat ataaaacaat gataaagcta 7560 
atggagattt cagtttaatc gattgaaaat caaaactcaa gatcaaatca aatgtagctt 7620 
tttgttttta acatttaaaa tatttaaaga tttcttttcc cagaagtcaa catgtggaaa 7 680 
tttttttcta agtatagacc aatagttaga gcttctttct ttgaaggttg cagaaaagag 7740 
tgaatataaa -ttgaaaatgt ttaaactcta tattttttcc cctataatct tgtgggcata 7800 
taatcctacc ctaacttcca gagagccaag caagagccaa cacttttcta catttctgaa 7 860 
aatatcagga tgaatttgtt cacaaattac ttgaatggga atgggccaga tcaaaggggt 792 0 
aaggaatctt catttaccta gcaatgatgg aaaaaaaata ttttggaggc tggagtaatt 7980 
aaaactggac atttatgaat ttctttgtaa caactagaaa tgcagaagtc ctttcattct 8040 
agttccccta aaaatatatt agttggagtt acagagacat agagctgaaa gaatgagaga 8100 
tttttaaaaa tacgttttgc cgggtgatgg gcaaaaggtg agagaaaggt gattctagac 8160 
atcaagttgt ctgaatcagg agaaggttct aaggtgagag aagggaagta aatatatatg 8220 
ctcctctcct tactaaaagg aagtggaaat tcaagaagaa gggaagatgc agaaagtaga 82 80 
gatcatagaa agtaggggga cctcagggaa ctctgagatc taatgctgag ctcagaaatg 8340 
ctttctgctt ttgctttttc attaaatctt ctagtagtaa gaagcagacc tctacacttc 8400 
agaagttctt cttccctaag aaggttaaga ggatttatta ccacgggttg gaggaggtgc 8460 
aaatagagaa gaaatatttg aaaagctgta gataaagaag gatacatttt agtaaaagag 8520 
gtataaacaa agtatataga gaaaacagaa ttaaagatga actgaaatag aagaaatttt 8580 
tatgcacact gtactcttag aggtttaggg gagaaaacca aataatctga ttcaaactgg 8640 
aatgcctttc cacaatttta tccctccttt taaatcacag agaagtttct ctgttcttga 87 00 
agacactttc ccaggtcagt ttatgaggaa acaatttata tgcctcgatt tagtctttga 87 60 
acacattgcc tatctcaggg ttattgaaac tttgggagtt tgattgattt tttactttgt 8820 
atttcttact acatggaggg gtctctatgt tttcatctat gggttaatgg agtagctatt 8880 
tcataggaat tcttgatcca tacttattct atttttctat cattatttat ttggtcataa 8940 
attaaattaa gaaaaaatga atgagtagat tagcaaatga atatttgttt tcataccaga 9000 
acaacttaat cccaaacaag caaacaaaag agatgcatat ttagaacagg gacagaggtt 9060 
ttatccatgc tgtccttgta attattttgt gtattctgaa ggcacaggag atgatccatc 9120 
cacatcgtct taagctgaat catagtggac aaatcctttc cactttctgg agcccggatt 9180 
cttcatttgt ataataagaa aattgaggag gtggtttcca agagattact tcgttgctga 9240 
ttctaaaatc tctgtaagca aacttgctaa ggaagatgat tttagtagca gtttgtattg 9300 
ctggaatgac tgagacctag agatgcccag aaagagggct gaaggtttaa agtcagtgcc 93 60 
aggaagacta aggagaggta tgactgtcat cattcaagcc tcaccctgtg gaaccacacc 9420 
ttggcctgag ccaatctgct cacagaagca gggaggtcag gaggcagggc taagcataaa 9480 
aggaagagct gggccagctg ctgcttacac ttgcttctga cacaaccgtg ttcactagca 9540 
actacacaaa cagacaccat gctgagcgct gaggagaagg ctgccgtcac ctccctattt 9 600 
gccaaggtga aagtggatga agttggtggt gaggccctgg gcaggtaggt atcctgctta 9660 
caagacaggt ttgctgctgc tgctgctaag tcgcttcagt cgtgtccgac tctgtgcgac 9720 
cccatagacg gcagcccatc aggcttcacc gtccctggga ttctccaggc aagaacattg 9780 
gagtgggttg ccatttcctt ctccaatgca tgaaagtgaa aagtgaaagt gaagtcgctc 9840 
agtcatgtcc ccatggactg cagcgatccc atggactgcg ccttccaggc tcctccgtcc 9900 
atgggatttt ccaggcaaga gtaccggagt ggggtgccat tgccttctcc gacaagacag 9 960 
gtttaaggag agtgaaatgc acctgggcgt gtgaggacaa agccgtccct gagattctga 10020 
aagctgctga cttcctctga ccttgtgctg ttttctcccc ctaggctgct ggttgtctac 10080 
ccctggactc agaggttctt tgagtccttt ggggacttgt cctctgccga tgccattttg 10140 
ggaaacccta aggtgaaggc ccatggcaag aaggtgctgg actccttctg tgagggcctg 10200 
aagcaacttg atgacctcaa gggtgccttt gcttcgctga gtgagctgca ctgtgataag 10260 
ctgcacgtgg atcctgagaa cttcagggtg agtttgtgga atcctcaata ttctcttttt 10320 
atggtcaagc ttgtgtcatg gggagagggc tgactggcag gacgtagttt agaatggaga 10380 
agatgtattc tggttaaagt gctaaggact cctcagaacc atttagattc ttttaatctc 10440 
tttgctcaca ataatcattt cttctgattc attcttgttc tctgttgtct gcaatgtctt 10500 
ctctttttaa ttatactctt tgagtgttca atttgaaaaa aagatttctt catctacttt 10560 
aaacataata tctaatattt tccccttatc tgttcctttc aagggataaa atgttgtatt 10620 
gcttttcgaa gtgattcaaa ataataaaaa tgataacaag ttctggatta agatagaaag 10680 
aaagaaacat ttctaaatat aaattcagcc tgctatgggt accttcatat cagcagtagc 10740 
atctatactt cagtcatctt tgtgctaata ccctaggggc acagcttggg atgaatctga 10800 
agtactctga atctaacctg ggcacctgca ctaaccctgc ccttgcttaa tgtcttttcc 10860 
acacagctcc tgggcaacgt gctggtggtt gtgctggctc gccgctttgg cagtgaattc 10920 
tccccggagc tgcaggctag ctttcagaag gtggtgactg gtgtggccaa tgccctggcc J-0980 
cacagatatc actaagctcc ccttcctgat tttcaggaaa ggtcttttca tcctcagagc 11040 
ccaaaaactg aatatggaaa aattatgaag cgttttgtgc atcttgcctc tgcctaataa 11100 
agacatttat tttcattgca ctggtgtatt taaattattt cactgtctct tactcagatg 11160 
ggcacacggg agggcaaagc actgaagaca taaagaaatg aaggcctaag ttgagacttt 11220 
gagaaaatac atcagtatct tggaccccat gacaggagtg gttgtacaca gctgatgtta 11280 
ttggaaaaca ggctctgctc cttactctta ctttccttta aagaattcca ttctccccag 11340 
gttgctttaa catctacaca cggaagtgac tagaggtctg ccctctttat ctggtctctg 11400 
atggcttctc tgcctttgta gtcattagca tactgttacc agaagcaaca tatgccaatt 11460 
taatcctctc cttcaataac cgcttatccg tatacttgat tctggaaatg atgtatgaat 1152 0 
ttttagtaat ccttgtttcc tcttgcatcc taattctgaa tctctgttct gttcagtcca 11580 
gtggctcagt tgtgtacaat tctttgcaac cccatggact gcagcatgcc aggcttccct 11640 
gtccatcacc aactcctgga gcttgctcaa attcatgccc attgagttgg tgatgccatt 11700 
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caaccatctc atcctctgtt atccccttct cttgccttca atctttccca gcatcagggt 11760 
cttttccaat gagtcagttc ttcacatcac gtgtccaaag tattgaagct tcacgttcgg 1182 0 
tatcagtcct tccaatgaat attcaggact aatatccttc aggtttgatt ggtttgatct 11880 
ccttgcagtc caagggactc tcaggagtct tctccaacat cacagttcaa aagcatcaat 11940 
tcttcagcgg tcacctttct tcattgtcca actctcacat ccacacatga ctactggaaa 12000 
aaccatagcc ttgactagat ggacatttgt tgataaagta gtgtctctgc tttttaatat 12060 
gctgtctaga tttgtgatag cttttcttcc aagaagcaag catctttgaa tttcatggct 12120 
gcagtcacca tcttcagtga ctttggagcc ccccaaaata aagtgtctca ctgtttccat 12180 
tgcttcccca tatatttgcc atcaaatgat ggtaccagat gccatggtct tggttttttg 12240 
aatgctgact tttacgccag ctctttcact ctcctctttc actttcacca agaggctctt 12300 
tagctcctct ttgctttctt ccataagggg ggtgaccact gcatatctga gcttattgat 12360 
atttctccag gcaatcttga ttccagcttg tgcttcatcc agcctgccat tttgcataat 12420 
gtacttggca tagaggttaa ataagcaggg tgacaataca cagccttgac gcattccttt 12480 
ccctattctg aatctagttg gggaggttct gaaagtaacg catctaacaa gtgactatct 12540 
aataacgcat ctaataaatg aatatacaag gcttttatga gatctgagga tgaagccctt 12 600 
ggtatgaaat ggtggtaaag ataagtagta tagacaattg tccagaagac aagaagacaa 12 660 
aagtcagaaa agaaaaggag gtcaagtcac tgatgaataa tgtgtttgtg tattctgaga 12720 
atttcaggta aacaggaaag ataactgaag caaaactgcc tgagatagaa tagaatagat 12780 
tataaatttt ggtgaacaca gtggagcgta ttcttaaagt taaaaattca cagacaagag 12840 
gatacaattc agccattact cacaagatgt ttgcacagcc aattgacaag taaagtaaga 12900 
tagtttaaca gtttagaaca tcattttgaa atcagacagc aaggattgaa aaaacattac 12960 
tacttattag ctctgacctt gtgaaaattc tgctgtcaat ttggtatata aatatgtcgt 13020 
ctttaagatt tgcaggaaaa tgaaaaaaaa taaaagtctc aacacaattc catactttga 13 080 
cctttaacaa aagttaatgt caggtgagaa tttacaaaga gtaagttgac tcatttaaca 13140 
tctcttccta aatgcaaaag atctcaggcc ggggtggcaa tattgaagag tagggaattt 13200 
gggggccttt tttttttttt tggatcttaa caacaggcaa ccatgctagc agcgtctatg 132 60 
tctagaagaa agttgaatta ctatgtacat ctggaatcaa caaattaaaa gtctcttatt 13320 
tttaagacaa atgttttagg gggtctaatc agagatgaaa atgtgatggt aggaagcagc 13380 
ttgagaacag tccagttgaa ataacacagg gcacaaaaaa gagacaaaga tatcaaataa 13440 
tataaaaatg ttaatcagtt ggctgatgca tagatcatca aatgccctct tatattttgc 13 500 
tgttgctgag tattttcaag gacggctaac aagagcatgg atgctcattt atgttaagtc 13560 
taatgagatc atacaaatcc tgatgggaac ttctcaaaca cccagatgac caccctgcaa 13 620 
cataaatttt ttttttgctg tctaagaaat ggatgtcatt tttcaatact cattcattat 13 680 
aagaatatta tttatagata ccttatataa tacatagtaa gtaaaataat atgctgtgct 13740 
tagtcactca gtcatgtctg agtctttgtc cccatggact gtggcacctc tgtccatggc 13800 
gattctccag gcaagaatac tggagtggct tgccatgtcc ttctgcaggg gatcttccca 13 8 60 
acccaggaat ggaaccgagt ctcctggata aaggtggatt ctttaccagt ggaactacca 13920 
taaatatatg taaatataca tataggtatg taaacataca tatgtataga tatgtaaaca 13 980 
tacataaaga tagtacataa taaaggggaa cattcaagaa atgaatagtt atagtcattc 14040 
acataaactg tgatttgaag actattctag atttgtggca aataagctgt ctctgatgca 14100 
aatgtattta caacccctga acatctagtt ctctagacta ggcttataaa gtgttcttat 14160 
ctttcagtct tttcaaaact cttaaatact tgacctgacc tatatttact caattcagaa 1422 0 
caaatctgcc ttatgcattt tattcatctt tgaggcttcc ctggtggctc agatggcaaa 14280 
gaatctgcct gcaaagtggg agacctgggt tcgacccctg gaaaagatgt gggaagagcc 14340 
cctggagttg agcatggaaa tcctgtacaa tattcttgcc tggagaatcc ccatggacag 14400 
agaagctggc gggctatagt ccatgtggtc acaagagttt gacaggaccg acctactaag 14460 
cacattatgc tttgaagaaa gggaaaattt cacagcctgc tctgccccct gctggcatca 14520 
tgtgaagaat catctattta agagtgacac tatcatgaat ttttcgaggc aggtccagtg 14580 
tgggtgccat tgaataatta tggtgaagta cgaacaaaaa tctggatttt cctggaatac 14640 
agagttgttc ttattcaatg taattataaa gatactgatt tttagactta acactctccc 14700 
tctgcctttg gctttccagg tggcgcaagt ggtaaagaat ccttctgcca atgcaggaga 147 60 
tgcaaggacc taggttccat ctctgggttg ggaagatccc ctggggtagg aaatggcaac 14820 
ccacttaaat attctggcct gaaaaaaacc atcgacagag gagcctggca ggctacagtc 14880 
catggggttg caaaaagtca gatacaactg agctactaac acttcttatg cctttaggtt 14940 
agtcaaggat tgttggtcat tacctattcc aaaaaaagtc atgaagtaat cacataggga 15000 
atcatttcca aaggacagaa aaaaagtacc ataggtgtct gaacacataa gtcagtttaa 15060 
actgtagaat ataagaaaag actgccctga aaaaaggttt ctgtttaatc agtcccagat 15120 
atgcttccaa atcagcattc cagacaaagt gctcagagga agaccatgat cctatcgtta 15180 
cagtggatgg aaactcacct atgcttgtcc tctccagttg tttattttat gtcagacagt 15240 
attctggttg cccaattcag gaagaagtat tctgcaccct gactttgtgg tggtcacata 15300 
acacagcatg atgtttgggg aaataatggg tctctttttt tcttcttggt acaggtgaag 153 60 
catctgacat ctctgaataa tttccctgtg gaaccttcat ttatcattct cattaacact 15420 
gtactttcct ggctaccatc ctacatctca aacagaaaaa aaaaaaaaat tccaacattt 15480 
atacttttct gttgctgtaa tgttgatatt ctctttggca gtaccattaa cattcatggc 15540 
ttctctgaag aagcttgaag aagagtatgt tctttcagtg caggcctaca ttcacagaat 15600 
agttgcatct tttggcatca atcccagatc agatcagatc agatcagtcg ctcagtcgtg 15660 
tccgactctt tgcgacccca tgaattgcag cacgccaggc ctccctgtcc atcaccaact 1572 0 
cccggagttc actcagactc acgtccatcg agtcagtgat gccatccagc catctcatcc 15780 
tctgtcgtcc ccttctcctc ctgcccccaa tctctcccag catcagagtc ttttccaatg 15840 
agtcaactct tcgcatgagg tggccaaagt actggagttt cagctttagc atcattcctt 15900 
ccaaagaaat cccagggctg atctccttca gaatggactg gttggatctc cttgcagtcc 159 60 
aagggactct caagagtctt ctccaacacc acagttcaaa agcatcaatt tttcggtgct 16020 
cagccttctt cacagtccaa ctctcacatc catacatgac cacaggaaaa aacatagccg 16080 
tgactagacg aacctttgtt ggcaaagtaa tgtctctgct tttcaatatg ctatctaggt 16140 
tggtcataac tttccttcca aggagtaagc atcttttaat ttcatggctg cagtcaccat 16200 
ctgtagtgat tttggagccc agaaaaataa agtctgccac tgtttccact gtttccccat 162 60 
ctatttccca tgaagtgatg gaaccggatg ccatgatctt cattttctga atgttgagct 16320 
ttaagtcaac tttttcactc tccactttca ctttcatcaa gaggcttttg agttcctctt 16380 
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cattttctgt cataagggtg gtgtcatctg catatctgag gttattgata tttgtcccgg 16440 

caatcttgat tccggcctgt gtttcttcca gtccagcgtt tctcatgatg tactctgcac 16500 

agaagttaaa taaacagggt gacaatatac agccttgacg aactcctttt cctatttgga 16560 

accagtctgt tgttccatgt ccagttctaa ctgttgcttc ctgacctgca tacaaatttc 16620 

tcaagaggca gatcaggtgg tatggtattc ccatctcttt cagaattttc cacagttgat 16680 

tgtgatccac acagtcaaag ggtttggcat agtcaataaa gcagaaatag atgcttttct 16740 

ggaactctct tgctttttcc atgatccagc agatgttggc aatttgattc tctggttcct 16800 

ctgcctttct aaatccagct tgaacatcta gaagttcacg gttcacatat tcgtgaagcc 16860 

tggcttggag aattttgagc attactttac tagcgtggga gatgagtgca attgtgcggt 1692 0 

agtgtgagca ttctttgcat tgcctttctt tggagttgga atgaaaactg accttttcca 16980 

gtcctgtggc cactgctgag ttttccaaat ttgctggcat attgagtgca gcactttcac 17040 

agcatcatct ttcaggattt ggaatagctc aactggaatt cgaaaactga ccttttccag 17100 

tcctgtggcc actgctgagt tttccaaatt tgctggcata t 17141 



<210> 14 
<211> 95 
<212> DNA 
<213> Ovis sp. 



aattcgagaa aataatgaat gtcaaaggaa aagtgattct gtcaatgctg gttgtctcaa 60 
ctgtcattgt tgtgttttgg gaatatatcc acagg 9 5 



<210> 15 
<211> 2479 
<212> DNA 
<213> Ovis sp. 



<400> 15 

gagttctgct tgcctgtgag tctccattca 
tccaagaggg aactcatcat tattttcctc 
tgaattctac aacctacctg cttaagcaga 
taaattacca gccaaatcct atcaattctc 
cattgagcca gcccatctcc ccttttttaa 
tggctctttg ttgctaggca agggctttct 
atggcttctt ttgttgcgga tcacaggctc 
cataggctca acagtagttg tggcttgagg 
ccagactagg gattgaaccc ttgtcccctg 
caccagggac gtctccaccc catcacccct 
ttatccgttc cccacaccca gtgcccagat 
actctccagc ttcagatgct tttgtgggtc 
tgactgcccc tgcccgccct ctgactcctg 
aggtgagaca gctgcactat tttttctcgt 
ggaagctttt ctctaatcaa cctggtatag 
tcatgggact ttcttttcca tcctatcttc 
cagtaagccc ttccccaacc acccaaatca 
agcacccctt ccccctttga tcacaatttc 
actacgcacc agctgtgtgg gtatgggagg 
caattgtaaa agaccagcat gctttctgct 
agaggcagca gagccattga actgatcatt 
atcgagtcag aggaggccag caaccatggg 
atggtacaga tgaacttgtt tgcaaagtag 
aggttatcaa gtgggggagg ggcgagatgg 
gatcctgctg tgtattaaat agataactaa 
ggaacacaag gaactgtact cagtgctctg 
aatggcatcc cactccagta accttgcctg 
gctgcagtcc atggggtcgc taagagtcag 
ccctttcatg cattggagaa ggaaatggca 
ccagggacgg gggagcctgg tgggctgcca 
tgaagtgact tagcagtagc agcagctagg 
atatgtatag ctgatttact ttactgtata 
tgtactgtaa taaaaatgaa taaaccctct 
caggatactt tccctctgag cctcaaagtg 
ccatcagttc attgaattgg ctatttgccg 
ttctccttgc acgtgtgcct gctcagttgc 
tggactgttg cccaccggac acttctgtcc 
gggttgccgt gccctccttc agggggtctt 
gcattgcagg cggattcttc actgctgagc 
cacttgttaa atattcatca ccttttcctt 
gaaaaaaatc atatcccact cttgatatat 
ttcccaggag aaaataatg 



ggtgttcaca ggctccttga aattcatgca 60 
caagcctatt cctcctccaa ccaatctgac 120 
aaccttattt cttccttcct ccctaatggc 180 
tctctaaagt atgttttgaa tgaaatcaac 240 
agaatttcat gtatttaatt ttggttgccc 300 
ctagttgtgg tgggcagact tctcattgcg 360 
tgagcatgtg gcccttcagt ggctgtagca 420 
gcttcagagc gcctggtatg tgggatcatc 480 
catcggcagg cagattctta accactggac 540 
gttgcaatta gaatggtcta gcttgtcgtt 600 
gggtctctct gaatgaagat gggtcgggtg 660 
cccaccctct ttccatgccc ttccagcctc 720 
ccatcaccac caccacccgc ctcttcactc 7 80 
ccacgttttt gcttttgttt cccaaacaca 840 
atgctatttc tgaaatagtc tcatcccttc 900 
accccatggg aagctttaaa atcatctgcc 960 
aagtacttcc caagctgtag ttctctaacc 1020 
atgcagcaat gaagaaatat ttatgtacct 1080 
gaaaggccta cctgggaaat ggttgggtct 1140 
ctgaacggcg gagcacgtag ttaggatatc 1200 
ccccaaggtc acatgcagag agttagtgct 12 60 
aaccattagc tcaaccctca ctctagaaaa 1320 
aaatagtcac agatgtagag aacaaattta 1380 
ggtgaattga gggattggta ttcacatact 1440 
tgagaacctg atgactaatg agaaccttct 1500 
tggtgaccta actgggaagg cggagaaggc 1560 
gaaaatcccg tggatggagg agcctggtag 162 0 
acacgactga tcaacttcac tttcactttt 1680 
acccactcca gtgttcttgc ctggagaatc 1740 
tctctggggt cgcacagagt cggacacgac 1800 
aaggaaatct aaaaatgagg ggatatgtgt 1860 
gtgaaaacta atacaacatt ttagagtaac 192 0 
cccatgtcta tggtggatct tgtgatcagt 1980 
gccttagcat ctttcgcaac tcaggcagcc 2 040 
ttgcagaccc atgcaattta aggtagaacc 2100 
ttcagtcgtg tccaactctt tgcaacgcta 2160 
atggaattct ccaggcaaga gtactggagt 2220 
ctccatccag ggatcgaact tgcatctcct 2280 
ccctggggaa gcccagaaca tgtgccttag 2340 
ttagaaagga cataggtaga aataattatt 2400 
ttaatctatt ttcccccctc ttcttttctt 2460 

2479 



<210> 16 
<211> 1096 
<212> DNA 
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<213> Ovis sp. 
<400> 16 

gttttgggaa tatatccaca ggtaattatg 

tccatcagcc aagtcaccag gttgaattga 

agccctagac tttaatgact ctctttaaaa 

ttaattatct ttcttcctgc catgttccta 

gcaggctaga gccagaaaca gtaaggtctt 

accagatatg gaaattgcat taagaaattt 

gatttttgca gtcctaaaat atttgcaaag 

ttgctgcaaa atacactttg agtcgctggt 

tgctttgtct gttttttttt tttttttaac 

gagttttcaa attcatcagt ggattcacct 

gaaaatataa tgccttggag gttccagctg 

aggcagctac aaaaatgcac aatggttttg 

ttctttcgaa aaggccacct tacacctaga 

gtgaaataag tcagaaagag agaaacaagt 

ggaaagatgg tgcagatgaa actatctgca 
caggcttgtg gacacagtga gggaaggaga 

aaaatatata ttgccgtatg taaatagaga 

agatcaaatc caatgctctg tgacaatctt 

ggaggtccag aggggg 



gaacatgata aagtgatgtt aatgaacgtc 60 

aattaggact tcttccttcc tgtttccctg 120 

tcattagagc ttcaagcact ttctaaggaa 180 

atatctcaat ttttatatag cttgttctgg 240 

attaagacca aaatgtcata ttaaaatatg 3 00 

cagacaggaa ttccacgaga aattcaccct 360 

ttcaaaggaa caactcaaag ttgttgactt 420 

gattcatttg tgcctggcta aacttttgag 480 

tctggaaaac aaaatgaatg aaatatttct 540 

caaatatttg agctgcttca tgtgttttga 600 

gagaggcttc tgacagaaag aagtctcccc 660 

actttaagag gcatccgtac agctcagcct 720 

ggtggatgaa cctagagcct gttatacata 780 

gttatatatt aacgctttta tatggaatct 840 

gagcagcaaa ggagatgcag actcaaagaa 900 

gggtgcaatg aactgagaga acagcattga 960 

gccagtggga atttgctggg tgctgcaggg 1020 

agaggggtgt gatgggatgg gtggtgggag 1080 

1096 



<210> 17 
<211> 815 
<212> DNA 

<213> Ovis sp. 

<400> 17 

tgcagactca aagaacaggc ttgtggacac 

agagaacagc attgaaaaat atatattgcc 

gctgggtgct gcagggagat caaatccaat 

gatgggtggt gggagggagg ttcaagaggg 

atgtttatgt atggcagaaa ccaacacaat 

tcaaataaat aaaaaaggcc accgtacaac 

tattgatatc ttcatggtct tagtaggttc 

ttcagttcag ttcagtcgtt cagtcgtgtc 

acgccaggcc tccctgtcta tcaccaactc 

gttggtgttg ccatccagcc atctcatcct 

ccctcccagc ttcatgagtc ttttccatat 

gtactggagt ttcagcttta gcatcattcc 

tagaatggac tggttggatc tccttgcagt 

ccacagttca aaagcatcga ttcttcgggg 



agtgagggaa ggagagggtg caatgaactg 60 

gtatgtaaaa tagagagcca gtgggaattt 120 

gctctgtgac aatttagagg ggtgtgatgg 180 

akgggacata tgtatacgta tgactgattc 240 

attgttatta tcctccaatt agaaataaat 300 

aactctgggc caggagcact gtcttgtcac 3 60 

ccaagttcaa aatgatggcc tggataaaca 42 0 

ckactctttg cgaccccatg taatcgcagc 480 

ctagagttca ctcaaactcg tgtccattga 540 

ctgtcgtccc cttctcctct tgcccccaat 600 

gagtcaactt cttctcatga ggtggcctta 660 

ttccaaagaa cacccagggc tgatctcctt 720 

ccaagggact ctcaagagtc ttctccaaca 780 

ctcag 815 



<210> 18 
<211> 11034 
<212> DNA 
<213> Ovis sp. 

<400> 18 

gagttctgct tgcctgtgag tctccattca 
tccaagaggg aactcatcat tattttcctc 
tgaattctac aacctacctg cttaagcaga 
taaattacca gccaaatcct atcaattctc 
cattgagcca gcccatctcc ccttttttaa 
tggctctttg ttgctaggca agggctttct 
atggcttctt ttgttgcgga tcacaggctc 
cataggctca acagtagttg tggcttgagg 
ccagactagg gattgaaccc ttgtcccctg 
caccagggac gtctccaccc catcacccct 
ttatccgttc cccacaccca gtgcccagat 
actctccagc ttcagatgct tttgtgggtc 
tgactgcccc tgcccgccct ctgactcctg 
aggtgagaca gctgcactat tttttctcgt 
ggaagctttt ctctaatcaa cctggtatag 
tcatgggact ttcttttcca tcctatcttc 
cagtaagccc ttccccaacc acccaaatca 
agcacccctt ccccctttga tcacaatttc 
actacgcacc agctgtgtgg gtatgggagg 
caattgtaaa agaccagcat gctttctgct 
agaggcagca gagccattga actgatcatt 
atcgagtcag aggaggccag caaccatggg 
atggtacaga tgaacttgtt tgcaaagtag 
aggttatcaa gtgggggagg ggcgagatgg 
gatcctgctg tgtattaaat agataactaa 
ggaacacaag gaactgtact cagtgctctg 



ggtgttcaca ggctccttga aattcatgca 60 
caagcctatt cctcctccaa ccaatctgac 120 
aaccttattt cttccttcct ccctaatggc 180 
tctctaaagt atgttttgaa tgaaatcaac 240 
agaatttcat gtatttaatt ttggttgccc 300 
ctagttgtgg tgggcagact tctcattgcg 360 
tgagcatgtg gcccttcagt ggctgtagca 420 
gcttcagagc gcctggtatg tgggatcatc 480 
catcggcagg cagattctta accactggac 540 
gttgcaatta gaatggtcta gcttgtcgtt 600 
gggtctctct gaatgaagat gggtcgggtg 660 
cccaccctct ttccatgccc ttccagcctc 720 
ccatcaccac caccacccgc ctcttcactc 780 
ccacgttttt gcttttgttt cccaaacaca 840 
atgctatttc tgaaatagtc tcatcccttc 900 
accccatggg aagctttaaa atcatctgcc 9 60 
aagtacttcc caagctgtag ttctctaacc 1020 
atgcagcaat gaagaaatat ttatgtacct 1080 
gaaaggccta cctgggaaat ggttgggtct 1140 
ctgaacggcg gagcacgtag ttaggatatc 1200 
ccccaaggtc acatgcagag agttagtgct 12 60 
aaccattagc tcaaccctca ctctagaaaa 1320 
aaatagtcac agatgtagag aacaaattta 13 80 
ggtgaattga gggattggta ttcacatact 1440 
tgagaacctg atgactaatg agaaccttct 1500 
tggtgaccta actgggaagg cggagaaggc 1560 
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aatggcatcc cactccagta accttgcctg gaaaatcccg tggatggagg agcctggtag 1620 
gctgcagtcc atggggtcgc taagagtcag acacgactga tcaacttcac tttcactttt 1680 
ccctttcatg cattggagaa ggaaatggca acccactcca gtgttcttgc ctggagaatc 1740 
ccagggacgg gggagcctgg tgggctgcca tctctggggt cgcacagagt cggacacgac 1800 
tgaagtgact tagcagtagc agcagctagg aaggaaatct aaaaatgagg ggatatgtgt 1860 
atatgtatag ctgatttact ttactgtata gtgaaaacta atacaacatt ttagagtaac 1920 
tgtactgtaa taaaaatgaa taaaccctct cccatgtcta tggtggatct tgtgatcagt 1980 
caggatactt tccctctgag cctcaaagtg gccttagcat ctttcgcaac tcaggcagcc 2040 
ccatcagttc attgaattgg ctatttgccg ttgcagaccc atgcaattta aggtagaacc 2100 
ttctccttgc acgtgtgcct gctcagttgc ttcagtcgtg tccaactctt tgcaacgcta 2160 
tggactgttg cccaccggac acttctgtcc atggaattct ccaggcaaga gtactggagt 2220 
gggttgccgt gccctccttc agggggtctt ctccatccag ggatcgaact tgcatctcct 22 80 
gcattgcagg cggattcttc actgctgagc ccctggggaa gcccagaaca tgtgccttag 2340 
cacttgttaa atattcatca ccttttcctt ttagaaagga cataggtaga aataattatt 2400 
gaaaaaaatc atatcccact cttgatatat ttaatctatt ttcccccctc ttcttttctt 2460 
ttcccaggag aaaataatgn trnnbndary startcdnnd rndgrstnsn cndcatdasn 2 520 
grsdnntrnb ndarygtttt gggaatatat ccacaggtaa ttatggaaca tgataaagtg 2580 
atgttaatga acgtctccat cagccaagtc accaggttga attgaaatta ggacttcttc 2640 
cttcctgttt ccctgagccc tagactttaa tgactctctt taaaatcatt agagcttcaa 2700 
gcactttcta aggaattaat tatctttctt cctgccatgt tcctaatatc tcaattttta 2760 
tatagcttgt tctgggcagg ctagagccag aaacagtaag gtcttattaa gaccaaaatg 2820 
tcatattaaa atatgaccag atatggaaat tgcattaaga aatttcagac aggaattcca 2880 
cgagaaattc accctgattt ttgcagtcct aaaatatttg caaagttcaa aggaacaact 2940 
caaagttgtt gacttttgct gcaaaataca ctttgagtcg ctggtgattc atttgtgcct 3000 
ggctaaactt ttgagtgctt tgtctgtttt tttttttttt ttaactctgg aaaacaaaat 3060 
gaatgaaata tttctgagtt ttcaaattca tcagtggatt cacctcaaat atttgagctg 3120 
cttcatgtgt tttgagaaaa tataatgcct tggaggttcc agctggagag gcttctgaca 3180 
gaaagaagtc tccccaggca gctacaaaaa tgcacaatgg ttttgacttt aagaggcatc 3240 
cgtacagctc agcctttctt tcgaaaaggc caccttacac ctagaggtgg atgaacctag 3300 
agcctgttat acatagtgaa ataagtcaga aagagagaaa caagtgttat atattaacgc 3 3 60 
ttttatatgg aatctggaaa gatggtgcag atgaaactat ctgcagagca gcaaaggaga 3420 
tgcagactca aagaacaggc ttgtggacac agtgagggaa ggagagggtg caatgaactg 3480 
agagaacagc attgaaaaat atatattgcc gtatgtaaat agagagccag tgggaatttg 3540 
ctgggtgctg cagggagatc aaatccaatg ctctgtgaca atcttagagg ggtgtgatgg 3600 
gatgggtggt gggagggagg tccagagggg ggrstnsncn dcatdasngr sdtgcagact 3660 
caaagaacag gcttgtggac acagtgaggg aaggagaggg tgcaatgaac tgagagaaca 3720 
gcattgaaaa atatatattg ccgtatgtaa aatagagagc cagtgggaat ttgctgggtg 3780 
ctgcagggag atcaaatcca atgctctgtg acaatttaga ggggtgtgat gggatgggtg 3 840 
gtgggaggga ggttcaagag ggakgggaca tatgtatacg tatgactgat tcatgtttat 3900 
gtatggcaga aaccaacaca atattgttat tatcctccaa ttagaaataa attcaaataa 3960 
ataaaaaagg ccaccgtaca acaactctgg gccaggagca ctgtcttgtc actattgata 4020 
tcttcatggt cttagtaggt tcccaagttc aaaatgatgg cctggataaa cattcagttc 4080 
agttcagtcg ttcagtcgtg tcckactctt tgcgacccca tgtaatcgca gcacgccagg 4140 
cctccctgtc tatcaccaac tcctagagtt cactcaaact cgtgtccatt gagttggtgt 4200 
tgccatccag ccatctcatc ctctgtcgtc cccttctcct cttgccccca atccctccca 4260 
gcttcatgag tcttttccat atgagtcaac ttcttctcat gaggtggcct tagtactgga 4320 
gtttcagctt tagcatcatt ccttccaaag aacacccagg gctgatctcc tttagaatgg 43 80 
actggttgga tctccttgca gtccaaggga ctctcaagag tcttctccaa caccacagtt 4440 
caaaagcatc gattcttcgg ggctcaggrs tnsncndcat dasngrsdaa gagtcttctc 4500 
caacaccaca gttcaaaagc atcgattctt cggggctcag ccttcttcat agtccaactc 4560 
tcacatccat atatgaccac tggaaaaacc atagccttga catagatgga cctttgttgg 4620 
caaagtaatg tctctgcttt tgaatatgct ataaacatta gaatgttgct ttcagttctt 4680 
aaattctaag atttagatgg gctggtagga ctgtcctggc tgcttataaa ggatctgtac 4740 
cttcaggtac atcagtggaa ggaggaatct ctggactctg gagctgtttt tccccagggt 4800 
gacctaacca gttctggagt attattgttt cattctgaaa aacttgtttt aaaagaaaca 4860 
ggacaagcta gagagtgacc atgttaaagg ggtctacaaa ccaggtcttt caaattgaag 4920 
gaaaccatgg gtacttagtc tgaagaaaag gtgactcagt gttggatgca gggaccctgg 4980 
ggctcccttt tgctgagggc ggaggaagta tcctggctgg agtctggaac taggctgcct 5040 
ggggctggtc ctgtgccatg acttgcctcc tctgtgactt tgggcaagtt acttcaccac 5100 
tctgttcctt ggttttcaat aagtttccct gttgtttaaa atgggcataa tggtagtacc 5160 
tgctttgtag ggttgttgga ttaaataagt tattgtgtat agagcactta gggcagtgcc 5220 
ctcagaaaac gttcgctgcc atcaccatca tcaccaccat tctcatcatc tgctcagagg 5280 
taagatttgg caaatagaag gaagagctcc agcagttgga agaatcctac agaagagtga 5340 
gcgtcctttg gggtcttgcc ctgagctgag ctacttgtgt aggtaacctt gagagcattt 5400 
gagctctgaa tgcttgattg gataaccttg gcgttttgaa ttgcagccag agtcacactt 5460 
ccaaattttg ggtacttcac aaaattaaaa tatgggagcc aaaggcccaa atgtgcatat 552 0 
tggcactggg ctcccatagr ntrncsncrm arkbtkbwth rscttnndca tdasngrsdt 5580 
cggccgctct agaactagtg gatcccccgg gctgcaggaa ttccaaaaat gccagcnatt 5640 
agatagcccg aacgtgtgtc attgtttctc ccacacactc aagaccccaa ctgagaacta 5700 
aggaccgaag tgactaagcg cggagcttcc ccacgtcagc ctggctgacc actctccccg 5760 
ggcttccgca gctgcctctg cccttctcct cctagaggct ttggcccccg gagatctggc 5820 
caaccagaac ttcttcttcc agaggttgga tgggttcctg tttctgcttc catttccttg 5880 
tttagaatga catttctccc caaactgcaa tcattcttac ttaaaagtcg gagctgtgat 5940 
aatagtttaa aatataaccc attttgttcc tttggtgatt tttttttttc tttatcaaag 6000 
cgaaagcaaa ggtttacttt tcctattagc ctcctccctg caacccccct ccacgggcct 6060 
tcccatctgt atcttttctg agcgccacat gggaagttga ctgctcgctt gtacagagag 612 0 
tggtgaaggc ccgggtggat ctgagtgagg atctgtgcat cttcactccc aagaccgtcc 6180 
ccagcccaga gcccagaagc catcgccctc accctactgg tcaagtcctg atgtcaccac 6240 
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ctgtgctgat cccaggggca tctgtgatgg agaagggtga ctctggaatt ttgacatttt 63 00 
tactggaaga atctgtactg ttatgcgtct ggagcagatg cctagctcac ctcatccttt 63 60 
ggtatgagag gggatgagcc agccaccctg tagactgatt tactcctgat gttccacaaa 6420 
gaacagaatg tggcctgcgg ttccacgatg gcttcaccct cgtattcgca gcttcacaga 6480 
accaaacagt aaattgtaca ccctgagctc tcctcttgga ggccggaggg gactgcttgt 6540 
actctgcagc aataaatcct cccaagagtg gatctggact ttaggaggtg tgaagtttgg 6600 
tttgttttca aataacagtg cccggagcca attgcaatgc tttcatctct aaataatgtt 6660 
aaacctcaca agggccccct cccctccccc taggattgtt tcccaagctt cagggcacaa 6720 
tagatttttt tttcagagcc atggcaaagt ccctcttcat attgctcaga tatattctac 6780 
tgtaaacccc ggggtttttc cagagaccaa tattaaaacc acagtttatt tttgtagaaa 6840 
tgaacaggaa tccaaactgc aactaaggag caaaaacaaa tgccagttat tattactcac 6900 
aaaatgtcat aactggccgt gagtgtgatc agtaaattaa acttagtcca catccgctga 6960 
gcatgagtct gggggtgatg caagttgtgt taagccaact cccgctttcc ttctctatca 7020 
tccccacgct gtcacgtgat aagtgtgtag tttggctcac aaaagacata aacatgtttt 7080 
cggaatgaat aagtaaatag ctaaaatacc acatctatta gaaggaccta agtccaaaac 7140 
actgacaaca caaactgctg gcaaaaatat ggagcaagca gaactctgat tcactgctgg 7200 
cgagaaggca agatggttca gctgctttgg aagaaagctg aacatattct tatcatacag 7260 
tctagcaatt gcatccttgg tattcaccaa aaaaagctga aactttacag ccgcacagaa 7320 
acttacatac agatgtttaa acgcagcttt agtcacagtt tctaaaactt ggacataact 7380 
aagatgtcct ttagtaggtg aatggattaa taaactgttg tggcacgtca ggtggtggaa 7440 
tatattatca gtgcaaaaac acaaatctgt caagccatgg aagagataga ggaacttaat 7500 
gcatattatt aagtaaagaa accatctgaa agattgcctg ctgtcagatt ccagrntrnc 7560 
sncrmarkbt kbwthrsctt nndcatdasn grsdtaggga acaaaagctg ggtacgatat 7620 
catacatagt gaaaaatact tgtcattaca tatttgtcga aacccataga atacacacaa 7680 
ccaagagtga acttttttta gaaaaggaaa tgagatagtt aaaataagtt taaatcagag 7740 
ggtgtttgct ctggtatttg gatgggtctt ataatggtgg tagctggata atggagtggt 7800 
gtttggggta accactagaa aattcttaaa gggaatgtat aatgtggaag gatattcatg 7860 
tcccccataa cccattctct gggtcccaga gaagaagaga gtggtgatgc ttagtcttaa 7920 
ggcatcccca aaggacctaa gaggcttgtc accctttcat ctgatagggc aggataacat 7980 
tggagcggat gttggggttt ctgagagatg cctgagaact agcgccccct gatgttgatc 8040 
acggggtaca accttccagt tataaaatga gcaagttctg gggatctgat gtaaagcatt 8100 
gtggttattg ttaataatat tgttttctat actcataaag ttattcagat aataaatctc 8160 
aaatcttctc cccaaaacag aaatgatgat tatgtgatgc aatgaaggtg ttaactaatg 82 2 0 
ctactgctgc tgctgagtcg cttcagtcgt gtgcgactct gtgcgacccc atagacggca 8280 
gcccaccagg ctcccctgtc cctgggattc tccaggcaag aacactggag tgggttgcca 8340 
tttccttctc caatgcatga aggtgaaaag agaaagtgaa gtcactcagt cgtgtctgac 8400 
tcttagcgac cccatggact gcagcctacc agccttctcc gtccatggga ttttccaggc 8460 
aagagtactg gagtggggtg ccattgcctt ctccgaacta atgctacagt ggtgggrntr 8520 
ncsncrmarm mdatytnndc atdasngrsd tcctcctcca ggggctcttc ctgaccccag 8580 
gcgtcaaacc cgcatctcct gcgtctcttg catcacaggc agattctcta cccactgagc 8640 
catctgggaa gcccttttat aatatatgaa tataccaaat caacacttgg cacattttaa 87 00 
acttaaacta tgctatatgt caattatatc tcaataaaac aggaccaaaa aaggactagc 87 60 
acactttcac ctacccggag tctgaaccaa ctgataaata agacaaaggg ataaataagg 8820 
tagatggtag tgatgatcat cttcactgtg ttaaaagctg atattgaaga atatgggtta 8880 
taaaactgaa ggccctgggc ctcctgatgt tgaaccacag aattagaaag acgtaggcac 8940 
ccttcatcca tagcatgaga aactgcaggt gtctggatgt tggagaaggg cctgtatcat 9000 
ttagctacat tgcagggccc tagatagata gagatctgtt taaacctcga gggccggagg 9060 
gtcgagcagt gtggttttca agaggaagca aaaagcctct ccacccaggc ctggaatgtt 9120 
tccacccaat gtcgagcagt gtggttttgc aagaggaagc aaaaagcctc tccacccagg 918 0 
cctggaatgt ttccacccaa tgrntrncsn crmarmmdat ytnndcatda sngrsdtctt 9240 
ctatcgcctt cttgacgagt tcttctgagg ggatcggcaa taaaaagaca gaataaaacg 9300 
cacgggtgtt gggtcgtttg ttcggatccg tcgacaggcc tacctgcgga tattaagctt 9360 
gtcaagatgt cttggcagac aaaagagtat aatgtggtta gaaataacgt ctgactttgt 9420 
gccagtacat ttttgaattt gagagagtat tattctggct acttcctcag aaaagtaaca 9480 
cttaatttta acttaaaaaa aaaaaaaaac taacaaaaga ccaacacagc aagtacacat 9540 
tatttctcct tgtaacattg agccttgtaa tacgggagaa tgaacctgtg gtaatcggat 9 600 
gtaaattccc agtgatttct tacctattct cagttgcggg ggaggggaat ggatacacta 9 6 60 
tcagttgaac caaaaaagag aaaaaattat catagacaaa agccagaaac attctacatg 9720 
tgccaattaa cttactggag agaagtacgc aaagggacgt gtttggcaac aagacaatga 9780 
ttgtgagggg ctgtcctctt gatttcagtg tcttcctgtc tctgctgggt ctgaagcaac 9 840 
acagagttgc ttttgcagca ggagaactct tagtaggaca tagtctacct tggcgatcct 9900 
caaatggttt agtcagtgtc ctttccttat caggatattg gcttctgtgc caagactagt 9960 
caagattccc agtacatatg gtgaagatgt tggggaatca caggcaatct gagtcagtgg 10020 
gtgctcagca caaaatgaaa acaaagcctc caagtaagtt ggccatgaaa taagcaaaga 10080 
gagagtagaa aggrntrncs ncrmarkbtk bwthrscttn ndcatdasng rsdttactca 10140 
caagtgtgaa acaccataga aggcatgaat acataataaa tgatgcatac cttttaaaga 10200 
gagaaagcta gggtaagtgg ggatctactg ttcaagtctc agggattcaa acccagcact 102 60 
gtctgactct aaatcctttg ttctttccac agctcaagtc tttctctttc aaacacacac 1032 0 
acacacacac acacacacac acactttaac cattttgttt ttgagaccga gggcatcaag 10380 
gaatcaccac caagaactct caattcttgt actagcatta aagaaagaga acttagttga 10440 
ctcaaacttc acagatggat ggtaggtcta aactcatcct ggagcctggc tggaaagcag 10500 
tcaaacagaa cctgagcgat acgaggctca gggaagttca agggctgaga gggtacagag 10560 
gagagaaggt ctgggaggtg gaaacagtct cagaggggct gcttctgaat taagcactga 10620 
gagaagagaa agggatcact aagaagtgga aatggaatga ggtttcggag agaggaacag 10680 
caggtacgaa gcatggagga ctattctgag gcttttaggg agaccctgga tgaaatctca 10740 
taggagagcc tgtgtgaaaa gcagcaaaga gcttgaaagg gctttggctg gttgggagct 10800 
gcacatttgc ccagcatctg tgtctttctt tctcgcttta tctccttcat agagtctatg 10860 
tggtttccat ggcaactaat gccatagttg ggccaagtga cccagactaa gctaatcaac 10920 
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attttccctc cccccgcccc caggggggca gatgatctaa actagtccaa tcaagttgaa 10980 
tctcaggcct ttaattaagc ggccgccacc gcggtggagc tccattcgcc ctat 11034 

<210> 19 
<211> 1813 
<212> DNA 
<213> Ovis sp. 

<400> 19 4- *n 

tgcagactca aagaacaggc ttgtggacac agtgagggaa ggagagggtg caatgaactg 60 

agagaacagc attgaaaaat atatattgcc gtatgtaaaa tagagagcca gtgggaattt 120 

gctgggtgct gcagggagat caaatccaat gctctgtgac aatttagagg ggtgtgatgg 180 

gatgggtggt gggagggagg ttcaagaggg akgggacata tgtatacgta tgactgattc 240 

atgtttatgt atggcagaaa ccaacacaat attgttatta tcctccaatt agaaataaat 300 

tcaaataaat aaaaaaggcc accgtacaac aactctgggc caggagcact gtcttgtcac 3 60 

tattgatatc ttcatggtct tagtaggttc ccaagttcaa aatgatggcc tggataaaca 420 

ttcagttcag ttcagtcgtt cagtcgtgtc ckactctttg cgaccccatg taatcgcagc 480 

acgccaggcc tccctgtcta tcaccaactc ctagagttca ctcaaactcg tgtccattga 540 

gttggtgttg ccatccagcc atctcatcct ctgtcgtccc cttctcctct tgcccccaat 600 

ccctcccagc ttcatgagtc ttttccatat gagtcaactt cttctcatga ggtggcctta 660 

gtactggagt ttcagcttta gcatcattcc ttccaaagaa cacccagggc tgatctcctt 720 

tagaatggac tggttggatc tccttgcagt ccaagggact ctcaagagtc ttctccaaca 780 

ccacagttca aaagcatcga ttcttcgggg ctcagccttc ttcatagtcc aactctcaca 840 

tccatatatg accactggaa aaaccatagc cttgacatag atggaccttt gttggcaaag 900 

taatgtctct gcttttgaat atgctataaa cattagaatg ttgctttcag ttcttaaatt 960 

ctaagattta gatgggctgg taggactgtc ctggctgctt ataaaggatc tgtaccttca 102 0 

ggtacatcag tggaaggagg aatctctgga ctctggagct gtttttcccc agggtgacct 1080 

aaccagttct ggagtattat tgtttcattc tgaaaaactt gttttaaaag aaacaggaca 1140 

agctagagag tgaccatgtt aaaggggtct acaaaccagg tctttcaaat tgaaggaaac 1200 

catgggtact tagtctgaag aaaaggtgac tcagtgttgg atgcagggac cctggggctc 12 60 

ccttttgctg agggcggagg aagtatcctg gctggagtct ggaactaggc tgcctggggc 132 0 

tggtcctgtg ccatgacttg cctcctctgt gactttgggc aagttacttc accactctgt 13 80 

tccttggttt tcaataagtt tccctgttgt ttaaaatggg cataatggta gtacctgctt 1440 

tgtagggttg ttggattaaa taagttattg tgtatagagc acttagggca gtgccctcag 1500 

aaaacgttcg ctgccatcac catcatcacc accattctca tcatctgctc agaggtaaga 1560 

tttggcaaat agaaggaaga gctccagcag ttggaagaat cctacagaag agtgagcgtc 1620 

ctttggggtc ttgccctgag ctgagctact tgtgtaggta accttgagag catttgagct 1680 

ctgaatgctt gattggataa ccttggcgtt ttgaattgca gccagagtca cacttccaaa 1740 

ttttgggtac ttcacaaaat taaaatatgg gagccaaagg cccaaatgtg catattggca 1800 

ctgggctccc ata 1813 

<210> 20 

<211> 1379 

<212> DNA 

<213> Ovis sp. 

<400> 20 

actggcacac ccgggggtgc aggtagagct gggctgtggg tttgagacga gacccggggc 60 
accgggagtg ggtgggaggg cctggttggt tctagaacag gaggaccgca gagcttggtg 120 
gcggaaggac aggaggtgtg gacgagtgtc tcttgggggc tggggtgagg aggaggtgac 180 
ctggccttgc ccagcccccc tggaaaaact tagcttcacc cctggcagag gggcagtgtt 240 
gtgaaaaccc actctccacc catgtgtgag ggagagagga agttctcagt gggctgatct 300 
atttgccaat atctctttga tagcctcacc aaggccactt gcacacagca ggggcccgat 3 60 
ggggccacca agactttgcc ccagtcactg ggcctctccc cgcagtggct gaagtgcagg 420 
cacatggcgt ggcccttgtg tgacaggggt tgctctaaat gactgtgctt cagtctccac 480 
ttcccgccag ggaagggcag tctcttgcaa acccaaatgg ggttggagct gagtccccac 540 
cccattcccg ccttagcctg ggccccagcc gctgctcagc ctttctgcat ttactcttta 600 
tggttggttg gtttgtcttc gctcttacat tgcttcctgt gtttgtattg gctctgccaa 660 
gtccctttca agtgtgctga gtgcacagaa cacccccctc ccccaccccc ggctttgtaa 720 
ggaggaaacg tgatctgaga gaggtgacag ccagggtccc tgtgtttcat ccctggctac 780 
tgtgagcctg caggatcctt tcttagtgac tgtctgctcc ttgtatgtca aatgaacaat 840 
aacaccagaa accccttagt cctataccca ggtaacaacc aaaaattcct ntctggcaaa 900 
aggcagattg aaacagcttc ttcctctttc agctctaatg tcactgtctt gtccccctgc 9 60 
gtcaagcatc agggctttct ctgggtcaga tggcacccat tgggataaat aaggtctaaa 1020 
gatgccgctg gcagcttagg aaggggctgg ggagatggga gtgggatatt gaaggaagat 1080 
gaggataggg gaggaaggaa agggggacag agggcatgag gtataaaggt aagacccccn 1140 
tagggccccc gctaaagcag cccccaggca cctgggccac cgtctcctgt tcctcaccca 1200 
gaagatgccg cagatgtctc actgccagct gactgttggg gttaaactcc cattaggatt 1260 
caaggatggt taccaccagc cacggctgag ccgttttcca tcagatgtct ctgtcccacc 1320 
ttagcaaggc catcatgact catggactct gttttgtcac cagtaaacgc cctgaggtt 1379 

<210> 21 

<211> 1973 

<212> DNA 

<213> Ovis sp. 
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<400> 21 

cggccgctct agaactagtg gatcccccgg 
agatagcccg aacgtgtgtc attgtttctc 
aggaccgaag tgactaagcg cggagcttcc 
ggcttccgca gctgcctctg cccttctcct 
caaccagaac ttcttcttcc agaggttgga 
tttagaatga catttctccc caaactgcaa 
aatagtttaa aatataaccc attttgttcc 
cgaaagcaaa ggtttacttt tcctattagc 
tcccatctgt atcttttctg agcgccacat 
tggtgaaggc ccgggtggat ctgagtgagg 
ccagcccaga gcccagaagc catcgccctc 
ctgtgctgat cccaggggca tctgtgatgg 
tactggaaga atctgtactg ttatgcgtct 
ggtatgagag gggatgagcc agccaccctg 
gaacagaatg tggcctgcgg ttccacgatg 
accaaacagt aaattgtaca ccctgagctc 
actctgcagc aataaatcct cccaagagtg 
tttgttttca aataacagtg cccggagcca 
aaacctcaca agggccccct cccctccccc 
tagatttttt tttcagagcc atggcaaagt 
tgtaaacccc ggggtttttc cagagaccaa 
tgaacaggaa tccaaactgc aactaaggag 
aaaatgtcat aactggccgt gagtgtgatc 
gcatgagtct gggggtgatg caagttgtgt 
tccccacgct gtcacgtgat aagtgtgtag 
cggaatgaat aagtaaatag ctaaaatacc 
actgacaaca caaactgctg gcaaaaatat 
cgagaaggca agatggttca gctgctttgg 
tctagcaatt gcatccttgg tattcaccaa 
acttacatac agatgtttaa acgcagcttt 
aagatgtcct ttagtaggtg aatggattaa 
tatattatca gtgcaaaaac acaaatctgt 
gcatattatt aagtaaagaa accatctgaa 



gctgcaggaa ttccaaaaat gccagcnatt 60 
ccacacactc aagaccccaa ctgagaacta 120 
ccacgtcagc ctggctgacc actctccccg 180 
cctagaggct ttggcccccg gagatctggc 240 
tgggttcctg tttctgcttc catttccttg 300 
tcattcttac ttaaaagtcg gagctgtgat 3 60 
tttggtgatt tttttttttc tttatcaaag 420 
ctcctccctg caacccccct ccacgggcct 480 
gggaagttga ctgctcgctt gtacagagag 540 
atctgtgcat cttcactccc aagaccgtcc 600 
accctactgg tcaagtcctg atgtcaccac 660 
agaagggtga ctctggaatt ttgacatttt 720 
ggagcagatg cctagctcac ctcatccttt 7 80 
tagactgatt tactcctgat gttccacaaa 840 
gcttcaccct cgtattcgca gcttcacaga 900 
tcctcttgga ggccggaggg gactgcttgt 9 60 
gatctggact ttaggaggtg tgaagtttgg 1020 
attgcaatgc tttcatctct aaataatgtt 1080 
taggattgtt tcccaagctt cagggcacaa 1140 
ccctcttcat attgctcaga tatattctac 12 00 
tattaaaacc acagtttatt tttgtagaaa 1260 
caaaaacaaa tgccagttat tattactcac 132 0 
agtaaattaa acttagtcca catccgctga 1380 
taagccaact cccgctttcc ttctctatca 1440 
tttggctcac aaaagacata aacatgtttt 1500 
acatctatta gaaggaccta agtccaaaac 1560 
ggagcaagca gaactctgat tcactgctgg 1620 
aagaaagctg aacatattct tatcatacag 1680 
aaaaagctga aactttacag ccgcacagaa 1740 
agtcacagtt tctaaaactt ggacataact 1800 
taaactgttg tggcacgtca ggtggtggaa 1860 
caagccatgg aagagataga ggaacttaat 1920 
agattgcctg ctgtcagatt cca 1973 



<210> 22 
<211> 920 
<212> DNA 
<213> Ovis sp. 

<400> 22 

agggaacaaa agctgggtac gatatcatac 
gtcgaaaccc atagaataca cacaaccaag 
tagttaaaat aagtttaaat cagagggtgt 
ggtggtagct ggataatgga gtggtgtttg 
tgtataatgt ggaaggatat tcatgtcccc 
agagagtggt gatgcttagt cttaaggcat 
ttcatctgat agggcaggat aacattggag 
gaactagcgc cccctgatgt tgatcacggg 
ttctggggat ctgatgtaaa gcattgtggt 
taaagttatt cagataataa atctcaaatc 
gatgcaatga aggtgttaac taatg'ctact 
actctgtgcg accccataga cggcagccca 
gcaagaacac tggagtgggt tgccatttcc 
gtgaagtcac tcagtcgtgt ctgactctta 
tctccgtcca tgggattttc caggcaagag 
aactaatgct acagtggtgg 



atagtgaaaa atacttgtca ttacatattt 60 

agtgaacttt ttttagaaaa ggaaatgaga 120 

ttgctctggt atttggatgg gtcttataat 180 

gggtaaccac tagaaaattc ttaaagggaa 240 

cataacccat tctctgggtc ccagagaaga 3 00 

ccccaaagga cctaagaggc ttgtcaccct 3 60 

cggatgttgg ggtttctgag agatgcctga 420 

gtacaacctt ccagttataa aatgagcaag 480 

tattgttaat aatattgttt tctatactca 540 

ttctccccaa aacagaaatg atgattatgt 600 

gctgctgctg agtcgcttca gtcgtgtgcg 660 

ccaggctccc ctgtccctgg gattctccag 720 

ttctccaatg catgaaggtg aaaagagaaa 780 

gcgaccccat ggactgcagc ctaccagcct 840 

tactggagtg gggtgccatt gccttctccg 900 

920 



<210> 23 
<211> 650 
<212> DNA 

<213> Ovis sp. 

<400> 23 

cctcctccag gggctcttcc tgaccccagg 

atcacaggca gattctctac ccactgagcc 

ataccaaatc aacacttggc acattttaaa 

caataaaaca ggaccaaaaa aggactagca 

tgataaataa gacaaaggga taaataaggt 

taaaagctga tattgaagaa tatgggttat 

gaaccacaga attagaaaga cgtaggcacc 

tctggatgtt ggagaagggc ctgtatcatt 

agatctgttt aaacctcgag ggccggaggg 

aaagcctctc cacccaggcc tggaatgttt 

agaggaagca aaaagcctct ccacccaggc 



cgtcaaaccc gcatctcctg cgtctcttgc 60 

atctgggaag cccttttata atatatgaat 120 

cttaaactat gctatatgtc aattatatct 180 

cactttcacc tacccggagt ctgaaccaac 240 

agatggtagt gatgatcatc ttcactgtgt 300 

aaaactgaag gccctgggcc tcctgatgtt 360 

cttcatccat agcatgagaa actgcaggtg 420 

tagctacatt gcagggccct agatagatag 480 

tcgagcagtg tggttttcaa gaggaagcaa 540 

ccacccaatg tcgagcagtg tggttttgca 600 
ctggaatgtt tccacccaat 650 
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<210> 24 
<211> 705 
<212> DNA 
<213> Ovis sp. 



<400> 24 

tataatgtgg ttagaaataa cgtctgactt 

tattattctg gctacttcct cagaaaagta 

aactaacaaa agaccaacac agcaagtaca 

taatacggga gaatgaacct gtggtaatcg 

tctcagttgc gggggagggg aatggataca 

tatcatagac aaaagccaga aacattctac 

cgcaaaggga cgtgtttggc aacaagacaa 

gtgtcttcct gtctctgctg ggtctgaagc 

tcttagtagg acatagtcta ccttggcgat 

tatcaggata ttggcttctg tgccaagact 

tgttggggaa tcacaggcaa tctgagtcag 

ctccaagtaa gttggccatg aaataagcaa 



tgtgccagta catttttgaa tttgagagag 60 

acacttaatt ttaacttaaa aaaaaaaaaa 120 

cattatttct ccttgtaaca ttgagccttg 180 

gatgtaaatt cccagtgatt tcttacctat 240 

ctatcagttg aaccaaaaaa gagaaaaaat 300 

atgtgccaat taacttactg gagagaagta 360 

tgattgtgag gggctgtcct cttgatttca 420 

aacacagagt tgcttttgca gcaggagaac 480 

cctcaaatgg tttagtcagt gtcctttcct 540 

agtcaagatt cccagtacat atggtgaaga 600 

tgggtgctca gcacaaaatg aaaacaaagc 660 
agagagagta gaaag 7 05 



<210> 25 
<211> 900 
<212> DNA 
<213> Ovis sp. 



<400> 25 

tactcacaag tgtgaaacac 

taaagagaga aagctagggt 

agcactgtct gactctaaat 

acacacacac acacacacac 

atcaaggaat caccaccaag 

agttgactca aacttcacag 

aagcagtcaa acagaacctg 

acagaggaga gaaggtctgg 

cactgagaga agagaaaggg 

gaacagcagg tacgaagcat 

atctcatagg agagcctgtg 

ggagctgcac atttgcccag 

tctatgtggt ttccatggca 

atcaacattt tccctccccc 

gttgaatctc aggcctttaa 



catagaaggc atgaatacat 

aagtggggat ctactgttca 

cctttgttct ttccacagct 

acacacacac tttaaccatt 

aactctcaat tcttgtacta 

atggatggta ggtctaaact 

agcgatacga ggctcaggga 

gaggtggaaa cagtctcaga 

atcactaaga agtggaaatg 

ggaggactat tctgaggctt 

tgaaaagcag caaagagctt 

catctgtgtc tttctttctc 

actaatgcca tagttgggcc 

cgcccccagg ggggcagatg 

ttaagcggcc gccaccgcgg 



aataaatgat gcataccttt 60 
agtctcaggg attcaaaccc 120 
caagtctttc tctttcaaac 180 
ttgtttttga gaccgagggc 240 
gcattaaaga aagagaactt 300 
catcctggag cctggctgga 3 60 
agttcaaggg ctgagagggt 420 
ggggctgctt ctgaattaag 480 
gaatgaggtt tcggagagag 540 
ttagggagac cctggatgaa 600 
gaaagggctt tggctggttg 660 
gctttatctc cttcatagag 720 
aagtgaccca gactaagcta 780 
atctaaacta gtccaatcaa 840 
tggagctcca ttcgccctat 900 



<210> 26 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Probes and PCR 
Primers 

<400> 26 

gagaaaataa tgaatgtcaa agga ^ 



<210> 27 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Probes and PCR 
Primers 

<400> 27 

tgataatccc agcagtattc 



<210> 28 

<211> 22 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Probes and PCR 
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Primers 

<400> 28 p9 
cttgatgggt ttatccagaa ca 



<210> 29 
<2X1> 23 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence : Probes and PCR 
Primers 

<400> 29 

ctgtggatat attcccaaaa cac " 



<210> 30 
<211> 95 
<212> DNA 

<213> Artificial Sequence 

<220> , , ^„ 

<223> Description of Artificial Sequence: Probes and PCR 

Primers 

aattcgagaa aataatgaat gtcaaaggaa aagtgattct gtcaatgctg gttgtctcaa 60 
ctgtcattgt tgtgttttgg gaatatatcc acagg 95 



<210> 31 
<211> 95 
<212> DNA 

<213> Artificial Sequence 

<220> ^ J n „ n 

<223> Description of Artificial Sequence: Probes and PCR 

Primers 

<400> 31 t4 _ cn 

aattcctgtg gatatattcc caaaacacaa caatgacagt tgagacaacc agcattgaca bU 

gaatcacttt tcctttgaca ttcattattt tctcg 95 



<210> 32 
<211> 24 
<212> DNA 

<213> Artificial Sequence 

<220> ^ ^ nm 

<223> Description of Artificial Sequence: Probes and PCR 

Primers 
<400> 32 

ggtggtttcc gagatggttt aaca ^ 



<210> 33 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Probes and PCR 
Primers 

<400> 33 

gggttgaacc agtccgatag ctta z * 



<210> 34 

<211> 21 

<212> DNA 

<213> Artificial 



Sequence 
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<220> 

<223> Description of Artificial Sequence: Probes and PCR 
Primers 



<400> 34 

tccaggatgc ctttgataga g 



21 



<210> 35 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Probes and PCR 
Primers 

<400> 35 

gggaggaagc gaaggtgca 19 



<210> 36 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Probes and PCR 
Primers 



<210> 37 
<211> 49 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Probes and PCR 
Primers 

<400> 37 

catcttgttc aatggccgat cccattattt tctcctggga aaagaaaag 49 



<210> 38 
<211> 49 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; Probes and PCR 
Primers 

<400> 38 

cttttctttt cccaggagaa aataatggga tcggccattg aacaagatg 49 



<210> 39 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Probes and PCR 
Primers 



<400> 36 

acgtggctcc aagaattctc caggcaagag tactgg 



36 



<400> 39 

caggtcgacg gatccgaaca aac 



23 



<210> 40 
<211> 47 
<212> DNA 



<213> Artificial Sequence 
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<220> , J _ 

<223> Description of Artificial Sequence: Probes and PCR 

Primers 
<400> 40 

cagatctaac gaggattcaa tgtcaaagga aaagtgattc tgtcaat 4/ 



<210> 41 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Probes and PCR 
Primers 

<400> 41 

ctgaactgaa tgtttatcca ggccatc ^ ' 



<210> 42 
<211> 51 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Probes and PCR 
Primers 

<400> 42 

gcgcaccgtg ggcttgtact cggtcattat tttctcctgg gaaaagaaaa g 51 



<210> 43 
<211> 34 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Probes and PCR 
Primers 

<400> 43 

gagaaaataa tgaccgagta caagcccacg gtgc 



<210> 44 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Probes and PCR 
Primers 

<400> 44 

ctggggatcc agacatgata agatac 



<210> 45 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Probes and PCR 
Primers 

<400> 45 

ctggttggtt ctagaacagg agga 24 



<210> 46 

<211> 66 

<212> DNA 

<213> Artificial 



Sequence 



84 



Express Mail EH643728336US 
Filed June 13, 2000 



PATENT APPLICATION 
Docket 730/002 



<220> 

<223> Description of Artificial Sequence: Probes and PCR 
Primers 

<400> 46 

catcttgttc aatggccgat cccattcctc ctcctcctcc actggtgaca aaacagagtc 60 
catgag 65 



<210> 47 
<211> 66 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Probes and PCR 
Primers 

<400> 47 

ctcatggact ctgttttgtc accagtggag gaggaggagg aatgggatcg gccattgaac 60 
aagatg 66 



<210> 48 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Probes and PCR 
Primers 

<400> 48 

caggtcgacg gatccgaaca aac 



<210> 49 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Probes and PCR 
Primers 

<400> 49 

cagctgtgtg ggtatgggag gg 



<210> 50 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Probes and PCR 
Primers 

<400> 50 

ctgaactgaa tgtttatcca ggccatc 



<210> 51 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Description of Artificial Sequence: Probes and PCR 
Primers 

<400> 51 

agccgattgt ctgttgtgcc cagtcat ^ ' 
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# * # * * 

It is understood that certain adaptations of the description and illustrations provided in 
this disclosure are a matter of routine optimization for those skilled in the art, and can be 
implemented without departing from the spirit of the invention, or the scope of the appended 
claims. 
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CLAIMS 

What is claimed as the invention is: 

1 . Ovine tissue devoid of antibody-detectable Gala(l 5 3)Gal determinants. 

2. The tissue of claim 1 , which is selected from the group consisting of lung tissue, kidney 
tissue, liver tissue, cardiac tissue, pancreatic tissue, and ocular tissue. 

3. Isolated ovine cell or tissue that expresses glycosyl transferase enzymes but does not 
detectably express a(l,3)galactosyltransferase(al,3GT). 

4. An ovine cell which is heterozygous or homozygous for inactivation of an al ,3GT gene. 

5. The cell of claim 4, produced by transfer of a nucleus from a donor cell heterozygous or 
homozygous for inactivation of an al,3GT gene, to an enucleated recipient cell 

6. An ovine animal that is homozygous for inactivation of an al ,3GT gene. 

7. A polynucleotide construct effective for inactivating an al,3GT gene in an ovine cell 

8. The polynucleotide construct of claim 7, comprising at least two polynucleotide 
sequences from an ovine al ? 3GT gene in a non-natural arrangement, for inactivating the 
ocl,3GT gene by homologous recombination. 



87 



Express Mail EH643728336US 
Filed June 13, 2000 



PATENT APPLICATION 
Docket 730/002 



9. The polynucleotide construct of claim 7, comprising an intron sequence of at least 30 
consecutive nucleotides found in any of recombinant phage B, C and G, deposited under 
Accession Nos. NCIMB 41056, 41059, 41060, and 41061. 

10. The polynucleotide construct of claims 7, comprising a selectable marker. 

1 1 . The polynucleotide construct of claim 10, wherein the selectable marker is flanked on 
either side by polynucleotide sequences from an ovine al,3GT gene. 

12. A method for inactivating an al ,3GT gene in an ovine cell, comprising contacting the 
cell with the polynucleotide construct of claims 7 under conditions suitable for 
homologous recombination of the construct into the genome of the cell. 

13. A method for producing an ovine that is homozygous for inactivation of an al ,3GT 
gene, comprising providing an ovine embryo of cells according to claim 4, engrafting the 
embryo into the uterus of a female, birthing an ovine with an inactivated ocl,3GT gene 
from the engrafted female, and if the birthed ovine has the al,3GT gene inactivated on 
only one allele, then mating it with another ovine with an inactivated al ,3GT gene, 
thereby producing an ovine that is homozygous for inactivation of the al,3GT gene. 

14. A method for producing an isolated ovine cell that expresses glycosyl transferase 
enzymes but does not detectably express al,3GT, comprising isolating the cell from an 
ovine homozygous for inactivation of an al,3GT gene according to claim 6. 
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15. A method for producing ovine tissue devoid of antibody-detectable Gala(l ,3)Gal 
determinants, comprising harvesting the tissue from an ovine homozygous for 
inactivation of an al,3GT gene according to claim 6. 

1 6. A method of xenotransplantation, comprising transplanting tissue devoid of antibody- 
detectable Gala(l,3)Gal determinants according to claim 1 into a mammal having 
circulating antibody against Gala(l,3)Gal determinants. 

17. An isolated polynucleotide that comprises a sequence of at least 30 consecutive 
nucleotides with at least one of the following properties: 

a) it is contained in SEQ. ID NO:l or any of SEQ. ID NOs:14 to 25, but not in 
any of SEQ. IDNOs: 3, 5, 7, 9, 11, and 13; 

b) it is contained in phage B, C and G deposited under Accession Nos. 
NCIMB 41056, 41059, 41060, and 41061; but not in ?i-phage or any of SEQ. ID NOs: 

3,5, 7, 9, 11, and 13; or 

c) it hybridizes under stringent conditions to a polynucleotide with the sequence 
in SEQ. ID NO:l or any of SEQ. ID NOs:14 to 25, but not to a polynucleotide with the 
sequence in any of SEQ. ID NOs: 3, 5, 7, 9, 1 1, and 13 

18. The isolated polynucleotide of claim 17, wherein said sequence is contained in SEQ. ID 
NO:l or any of SEQ. ID NOs: 14 to 25, but not in any of SEQ. ID NOs: 3, 5, 7, 9, 1 1, 
and 13. 

1 9. The isolated polynucleotide of claim 1 7, wherein said sequence is contained in phage B, 
C and G deposited under Accession Nos. NCIMB 41056, 41059, 41060, and 41061; 
but not in X-phage or any of SEQ. ID NOs: 3, 5, 7, 9, 1 1, and 13. 
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20. The isolated polynucleotide of claim 17, wherein said sequence hybridizes under 
stringent conditions to a polynucleotide with the sequence in SEQ. ID NO:l or any of 
SEQ. ID NOs:14 to 25, but not to a polynucleotide with the sequence in any of SEQ. 
ID NOs: 3,5, 7, 9, 11, and 13. 

21 . The polynucleotide of claim 17, which is a construct effective for inactivating a 
a(l,3)galactosyltransferase (od,3GT) gene in an ovine cell by homologous 
recombination. 

22. An isolated polypeptide that comprises a sequence of at least 10 consecutive amino 
acids with at least one of the following properties: 

a) it is contained in SEQ. ID NO:2 but not in any of SEQ. ID NOs: 4, 6, 8, 10, 
and 12; 

b) it is encoded in phage B, C and G deposited under Accession Nos. 
NCIMB 41056, 41059, 41060, and 41061., but not encoded in Arphage or present in 

any of SEQ. ID NOs: 4, 6, 8, 10, and 12; or 

c) it is at least 80% identical to 15 consecutive amino acids contained in SEQ. ID 
NO:2, wherein said sequence is not present in any of SEQ. ID NOs: 4, 6, 8, 10, and 12 

23. The isolated polypeptide of claim 22, wherein said sequence is contained in SEQ. ID 
NO:2 but not in any of SEQ. ID NOs: 4, 6, 8, 10, and 12. 

24. The isolated polypeptide of claim 22, wherein said sequence is encoded in phage B, C 
and G deposited under Accession Nos. NCIMB 41056, 41059, 41060, and 41061 ., but 
not encoded in X-phage or present in any of SEQ. ID NOs: 4, 6, 8, 10, and 12. 
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25. The isolated polypeptide of claim 22, wherein said sequence is at least 80% identical to 
15 consecutive amino acids contained in SEQ. ID NO:2, wherein said sequence is not 
present in any of SEQ. ID NOs: 4, 6, 8, 10, and 12. 

26. The polypeptide of claim 22, which has glycosyltransferase activity. 

27. An isolated polynucleotide comprising a sequence encoding a polypeptide according to 
claim 22. 

28. An isolated polyclonal antibody or a monoclonal antibody that binds specifically to a 
polypeptide with the sequence SEQ. ID NO:2 but not to a peptide with the sequence 
present in any of SEQ. ID NOs: 4, 6, 8, or 10. 

29. An assay method for determining al,3GT expression by a cell, comprising contacting a 
polynucleotide according to claim 17 with the cell or with mRNA or cDNA obtained 
from the cell, detecting any hybrids that form as a result, and correlating presence of the 
hybrids with expression of al,3GT by the cell. 

30. A method for producing the antibody specific for sheep al,3GT 5 comprising 
immunizing an animal or contacting an immunocompetent particle with a polypeptide 
according to claim 22. 

31. A method for preparing a Gala(l,3)Gal determinant, comprising contacting a galactose 
acceptor saccharide with the polypeptide of claim 26 in the presence of UDP-galactose. 
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32. An assay method for determining al ,3GT in a sample, comprising preparing a reaction 
mixture comprising the sample and an antibody according to claim 28 under conditions 
that permit the antibody to complex with ocl,3GT, and correlating any complex formed 
with the presence or amount of od,3GT in the sample. 
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ABSTRACT 



This invention provides immunologically compatible animal tissue, suitable for 
xenotransplantation into human patients. Sequence of the cDNA for the sheep 
a(l,3)galactosyltransferase has been determined, along with information about the flanking 
untranscribed regions and introns of the ocl,3GT gene. This in turn has led to the design and 
implementation of targeting vectors capable of inactivating the al,3GT gene by deleting or 
interrupting the protein coding region. Successfully targeted fibroblasts are used as nuclear 
donors for the cloning of animals in which the al,3GT gene is inactivated. The Gala(l 5 3)Gal 
determinant does not form on endothelial and other cells of animals having homozygously 
inactivated ocl,3GT. The tissues can be transplanted into human patients, without being 
subject to hyperacute rejection that would otherwise occur due to naturally occurring 
antibodies to the Gala(l,3)Gal determinant present in human serum. Because of the limited 
availability of human organs for transplantation, immunologically compatible animal tissue is 
an important alternative for human organs that are in such short supply for transplantation 
therapy. 
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Figure 1(A) 



5 ' -AGCCGAGGACGCCGCCGGGGAGCCGAGGCTCCGGCCAGCCCCCAGCGCGCCCAGCT 



TCTGCAGATCAGGAGTCAGAACGCTGCACCTTCGCTTCCTCCCAGCCCTGCCTCCTTCTG 



CAAAACGGAGCTCAATAGAACTTGGTACTTTTGCCTTTTACTCTGGGAGGAGAGAAGCAG 



ACGATGAGI 4 | GAGAAAATAATGAATGTCAAAGGAAAAGTGATTCTGTCAATGCTGGTTG 

MNVKGKVILSMLVV 

TCTCAACTGTCATTGTTGTGTTTTGGGAATATATCCACAG | 5 | CCCAGAAGGCTCTTTGT 
STVIVVFWEYIHS PEGSLF 

TCTGGATAAACCCATCAAGI 6 | AAACCCAGAAGTCAGTGGCGGCAGCAGCATTCAGAAGG 
WINPSR NPEVSGGSSIQKG 

GCTGGTGGTTTCCGAGATGGTTTAACAATGG | 7 | T T AC CAAG AAG AAG AT G AAG AC GT AG 
WWFPRWFNNG YQEEDEDVD 

ACGAAGAAAAGGAACAAAGAAAGGAAGACAAAAGCAAGCTTAAGCTATCGGACTGGTTCA 
EEKEQRKEDKSKLKLSDWFN 

ACCCATT I 8 I TAAACGCCCTGAGGTTGTGACTATGACAGATTGGAAGGCACCCGTGGTGT 
PF KRPEVVTMT DWKAPVVW 

GGGAAGGCACTTACAACAGAGCCGTCTTAGACGATTACTACGCCAAGCAGAAAATTACCG 
EGTYNRAVLDD'YYAKQKITV 

TCGGCCTGACGGTTTTCGCCGTCGGAAI 9 | GATACATTGAGCATTACTTGGAGGAGTTCT 
GLTVFAVGR YIEHYLEEFL 

TAACGTCTGCTAATAAGCACTTCATGGTTGGCCACCGAGTCATCTTTTACGTCATGGTGG 
TSANKHFMVGHRVI FYVMVD 

ACGACGTCTCCAGGATGCCTTTGATAGAGCTGGGCCCTCTGCGCTCCTTCAAAGTGTTTG 
DVSRMPLIELGPLRS FKVFE 

AGGTCAAGCCTGAGAGGAGGTGGCAGGACGTCAGCATGGTGCGCATGAAGACCATCGGGG 
VKPERRWQDVSMVRMKTIGE 

AGCACATCGTGGCCCACATCCAGCGTGAGGTTGACTTCCTCTTCTGCATGGACGTGGACC 
HIVAHIQREVDFLFCMDVDQ 



Figure KB) 



AGGTCTTCCAAGACGAGTTCGGGGTGGAGACCCTGGGTGAGTCGGTGGCCCAGCTACAGG 
VFQDEFGVETLGESVAQLQA 

CCTGGTGGTACAAGGCAGATCCCGATGAGTTTACCTACGAGAGGCGCAAGGAGTCTGCAG 
WWYKADPDEFTYERRKESAA 

CATACATTCCCTTCGGCGAAGGGGATTTTTATTACCACGCAGCCATTTTTGGGGGAACAC 
YI PFGEGDFYYHAAIFGGTP 

CCACTCAGGTCCTTAACATCACCCAGGAATGCTTCAAAGGAATCCTCAAGGACAAGAAAA 
TQVLNITQECFKGILKDKKN 

ATGACATAGAAGCCCAATGGCATGATGAGAGCCATCTAAACAAGTATTTCCTTCTCAACA 
DIEAQWHDESHLNKYFLLNK 

AACCCACTAAAATCTTATCCCCGGAATACTGCTGGGATTATCATATAGGCCTACCTGCGG 
PTKILSPEYCWDYHIGLPAD 

ATATTAAGCTTGTCAAGATGTCTTGGCAGACAAAAGAGTATAATGTGGTTAGAAATAACG 
IKLVKMSWQTKEYNVVRNNV 

TCTGAAGCCGAGGACGCCGCCGGGGAGCCGAGGCTCCGGCCAGCCCCCAGCGCGCCCAGC 



TTCTGCAGATCAGGAGTCAGAACGCTGCACCTTCGCTTCCTCCCAGCCCTGCCTCCTTCT 



GCAAAACGGAGCTCAATAGAACTTGGTACTTTTGCCTTTTACTCTGGGAGGAGAGAAGCA 



GACGATGAGGAGAAAATA 



Figure 2(A) 



sheep 

bovine CCGGGGGCCGGGCCGAGCTGGGAGCGTCGAGCCCGCTGCCCAGCGCCCGCCGGCTCCCTC 

sheep 

bovine 

GCGCCCCTGCCCGCCGCCCCGGAGGAGCGCCCGGCGGCCGGCCGACGGGAGCGCAGCGGC 

sheep 

bovine ACACCCCGCCCCGGCACGCCCGCGGGGCTCGGGAGGAGGCAGCGCGCCGACTGTTCCGGC 

sheep AGCCGAGGACGCCGCCGGGGAGCCGAGGCTCCGGCCAGCCCCCAGCGCGCCCAGCTTCTG 

bovine AGCCGAGGACGCCGCCGGGGAGCCGAGGCGCCGGCCAGCCCCCAGCGCGCCCAGCTTCTG 

***************************** ****************************** 

sheep CAGATCAGG 

bovine CGGATCAGGGAAACCACGTGTCCTCAAGTGGCCAGCCAGCTGTCCCCAAGAGGAACTTGC 

* ******* 

sheep AGTCAGAACGCTGCAC 

bovine C T GGC AT T T GC AC GGAAAGAC GAG ACACTTCACAAAATCAACGG AGTCAGAACGCTGCAC 

******** ******* 

sheep CTTCGCTTCCTCCCAGCCCTGCCTCCTTCTGCAAAACGGAGCTCAATAGAACTTGGTACT 

bovine CTTCGCTTCCTCCCAGCCCTGCCTCCTTCTGCAGAACGGAGCTCAGTAGAACTTGGTACT 

********************************* *********** ************** 

a 

sheep TTTGCCTTTTACTCTGGGAGGAGAGAAGCAGACGATGAGGAGAAAATAATGAATGTCAAA 

bovine TTTGCCTTTTACTCTAGGAGGAGAGAAGCAGACGATGAGGAGAAAATAATGAATGTCAAA 

*************** ******************************************** 

sheep GGAAAAGTGATTCTGTCAATGCTGGTTGTCTCAACTGTCATTGTTGTGTTTTGGGAATAT 

bovine GGAAAAGTGATTCTGTCAATGCTGGTTGTCTCAACTGTCATTGTTGTGTTTTGGGAATAT 

************************************************************ 

sheep ATCCACAGCCCAGAAGGCTCTTTGTTCTGGATAAACCCATCAAGAAACCCAGAAGTCAGT 

bovine ATCCACAGCCCAGAAGGCTCTTTGTTCTGGATAAACCCATCAAGAAACCCAGAAGT T 

******************************************************** * 

sheep GGCGGCAGCAGCATTCAGAAGGGCTGGTGGTTTCCGAGATGGTTTAACAATGGTTACCAA 

bovine GGTGGCAGCAGCATTCAGAAGGGCTGGTGGCTTCCGAGATGGTTTAACAATGGTTACCAT 

** *************************** **************************** 
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sheep 
bovine 



sheep 
bovine 



sheep 
bovine 



sheep 
bovine 



sheep 
bovine 



sheep 
bovine 



sheep 
bovine 



sheep 
bovine 



sheep 
bovine 



sheep 
bovine 



GAAGAAGATGAAGACGTAGACGAAGAAAAGGAACAAAGAAAGGAAGACAAAAGCAAGCTT 

GAAGAAGATGGAGACATAAAC GAAGAAAAGGAACAAAGAAACGAAGACGAAAGCAAGCT T 
********** **** ** ********************** ****** *********** 

AAGCTATCGGACTGGTTCAACCCATTTAAACGCCCTGAGGTTGTGACTATGACAGATTGG 

AAGCTATCGGACTGGTTCAACCCATTTAAACGCCCCGAGGTTGTGACCATGACGAAGTGG 
*********************************** *********** ***** * *** 

AAGGCACCCGTGGTGTGGGAAGGCACTTACAACAGAGCCGTCTTAGACGATTACTACGCC 

AAGGCTCCAGTGGTGTGGGAAGGCACTTACAACAGAGCCGTCTTAGACAATTATTATGCC 
***** ** *************************************** **** ** *** 

AAGCAGAAAATTACCGTCGGCCTGACGGTTTTCGCCGTCGGAAGATACATTGAGCATTAC 

AAGCAGAAAATTACCGTCGGCCTGACGGTTTTCGCCGTCGGAAGATACATTGAGCATTAC 
************************************************************ 

TTGGAGGAGTTCTTAACGTCTGCTAATAAGCACTTCATGGTTGGCCACCGAGTCATCTTT 

TTGGAGGAGTTCTTAACGTCTGCTAATAAGCACTTCATGGTGGGCCACCCAGTCATCTTT 
***************************************** ******* ********** 

TACGTCATGGTGGACGACGTCTCCAGGATGCCTTTGATAGAGCTGGGCCCTCTGCGCTCC 

TATATCATGGTAGATGATGTCTCCAGGATGCCTTTGATAGAGTTGGGTCCTCTGCGCTCC 
** ******* ** ** ************************ **** ************ 

TTCAAAGTGTTTGAGGTCAAGCCTGAGAGGAGGTGGCAGGACGTCAGCATGGTGCGCATG 

TTCAAAGTGTTTAAGATCAAGCCTGAGAAGAGGTGGCAGGACATCAGCATGATGCGCATG 
************ ** ************ ************* ******** ******** 

AAGACCATCGGGGAGCACATCGTGGCCCACATCCAGCGTGAGGTTGACTTCCTCTTCTGC 

AAGACTATCGGGGAGCACATTGTGGCCCACATCCAGCATGAGGTTGACTTCCTTTTCTGC 
***** ************** **************** *************** ****** 

ATGGACGTGGACCAGGTCTTCCT^AGACGAGTTCGGGGTGGAGACCCTGGGTGAGTCGGTG 

ATGGATGTGGACCAGGTCTTCCAAGACAAGTTTGGGGTGGAGACCCTGGGCGAGTCGGTG 
***** ********************* **** ***************** ********* 

GCCCAGCTACAGGCCTGGTGGTACAAGGCAGATCCCGATGAGTTTACCTACGAGAGGCGC 

GCCCAGCTACAAGCCTGGTGGTACAAGGCAGATCCCAATGACTTCACCTACGAGAGGCGG 
*********** ************************ **** ** ************** 



sheep 
bovine 



AAGGAGTCTGCAGCATACATTCCCTTCGGCGAAGGGGATTTTTATTACCACGCAGCCATT 

AAGGAGTCTGCAGCATACATTCCCTTCGGCGAAGGGGATTTTTATTACCATGCAGCCATT 
************************************************** ********* 
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sheep 
bovine 



sheep 
bovine 



sheep 
bovine 



sheep 
bovine 



TTTGGGGGAACACCCACTCAGGTCCTTAACATCACCCAGGAATGCTTCAAAGGAATCCTC 

TTTGGGGGAACACCCACTCAGGTCCTTAACATCACCCAGGAATGCTTCAAAGGAATCCTC 
************************************************************ 

AAGGACAAGAAAAATGACATAGAAGCCCAATGGCATGATGAGAGCCATCTAAACAAGTAT 

AAGGACAAGAAAAATGACATAGAAGCCCAATGGCATGATGAAAGCCATCTAAACAAGTAT 
***************************************** ****************** 

TTCCTTCTCAACAAACCCACTAAAATCTTATCCCCGGAATACTGCTGGGATTATCATATA 

TTCCTTCTCAACAAACCTACTAAAATCTTATCCCCGGAATACTGCTGGGATTATCACATA 
***************** ************************************** *** 

GG CCT ACCT GCGGAT AT TAAGCTTGTCAAG AT GTCTTGGCAGACAAAAGAGT AT AATGTG 

GGCCTACCTGCGGATATTAAGCTTGTCAAGATGTCTTGGCAGACAAAAGAGTATAATGTG 
************************************************************ 



sheep 
bovine 



GTTAGAAATAACGTCTGA 

GTTAGAAATAATGTCTGACTTTGTGCCAGTACATTTCTGAATTTGAGAGAGTATTATTCT 



*********** ****** 



Figure 3 



1 50 

bovine MNVKGKVILS MLVVSTVIW FWEYIHSPEG SLFWINPSRN PEV.GGSSIQ 

sheep MNVKGKVILS MLVVSTVIW FWEYIHSPEG SLFWINPSRN PEVSGGSSIQ 

********** ********** ********** ********** *** ****** 



51 



100 



bovine KGWWLPRWFN NGYHEEDGDI NEEKEQRNED ESKLKLSDWF NPFKRPEWT 

sheep KGWWFPRWFN NGYQEEDEDV DEEKEQRKED KSKLKLSDWF NPFKRPEWT 

**** ***** *** *** * ****** ** ********* ********** 



101 



150 



bovine MTKWKAPVVW EGTYNRAVLD NYYAKQKITV GLTVFAVGRY IEHYLEEFLT 

Sheep MTDWKAPVVW EGTYNRAVLD DYYAKQKITV GLTVFAVGRY IEHYLEEFLT 

** ******* ********** ********* ********** ********** 



151 



200 



bovine SANKHFMVGH PVIFYIMVDD VSRMPLIELG PLRSFKVFKI KPEKRWQDIS 

sheep SANKHFMVGH RVIFYVMVDD VSRMPLIELG PLRSFKVFEV KPERRWQDVS 

********** **** **** ********** ******** *** **** * 



201 



250 



bovine MMRMKTIGEH IVAHIQHEVD FLFCMDVDQV FQDKFGVETL GESVAQLQAW 

sheep MVRMKTIGEH IVAHIQREVD FLFCMDVDQV FQDEFGVETL GESVAQLQAW 

******** ****** *** ********** *** ****** ********** 

251 300 

bovine WYKADPNDFT YERRKESAAY IPFGEGDFYY HAAIFGGTPT QVLNITQECF 

sheep WYKADPDEFT YERRKESAAY IPFGEGDFYY HAAIFGGTPT QVLNITQECF 

****** ** ********** ********** ********** ********** 



301 350 

bovine KGILKDKKND IEAQWHDESH LNKYFLLNKP TKILSPEYCW DYHIGLPADI 

Sheep KGILKDKKND IEAQWHDESH LNKYFLLNKP TKILSPEYCW DYHIGLPADI 

********** ********** ********** ********** ********** 



bovine 
sheep 



351 370 
KLVKMSWQTK EYNVVRNNV* 
KLVKMSWQTK EYNWRNNV* 
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